(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 

International Bureau 

(43) International Publication Date 
27 December 2002 (27.12.2002) 




PCT 



mini! 


nun i! i M 1 1 in 


iiiiiiiiiii 


inn 



(10) International Publication Number 

WO 02/102984 A2 



(51) International Patent Classification 7 : C12N 

(21) International Application Number: PCT/US02/ 19051 

(22) International Filing Date: 14 June 2002 (14.06.2002) 



(25) Filing Language: 

(26) Publication Language: 



English 
English 



(30) Priority Data: 

60/298,173 
60/311,686 
60/316,995 



14 June 2001 (14.06.2001) US 
10 August 2001 (10.08.2001) US 
4 September 2001 (04.09.2001) US 



(71) Applicant (for all designated States except US): 
SLOAN-KETTERING INSTITUTE FOR CAN- 
CER RESEARCH [US/US]; 1275 York Avenue, New 
York, NY 10021 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): RICHON, Victo- 
ria [US/US]; 160 Theodore Fremd Street, #A11, Rye, NY 
10580 (US). ZHOU, Xianbo [CN/US]; 43 Bradley Street, 
Dobbs Ferry, NY 10522 (US). RIFKIND, Richard, A. 
[US/USJ;425 East 58th Street, #48 A, New York, NY 10022 
(US). MARKS, Paul, A. [US/US]; 7 Rossiter Road, Wash- 
ington, CT 06793 (US). 



(74) Agents: BROOK, David, E. et al.; Hamilton, Brook, 
Smith & Reynolds, PC, 530 Virginia Road, P.O. Box 
9133, Concord, MA 01742-9133 (US). 

(81) Designated States (national): AE, AG, AL, AM, AT, AU, 
AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CO, CR, CU, 
CZ, DE, DK, DM, DZ, EC, EE, ES, FI, GB, GD, GE, GH, 
GM, HR, HU, ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ, LC, 
LK, LR, LS, LT, LU, LV, MA, MD, MG, MK, MN, MW, 
MX, MZ, NO, NZ, OM, PH, PL, PT, RO, RU, SD, SE, SG, 
SI, SK, SL, TJ, TM, TN, TR, TT, TZ, UA, UG, US, UZ, 
VN, YU, ZA, ZM, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZM, ZW), 
Eurasian patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), 
European patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, 
GB, GR, IE, IT, LU, MC, NL, PT, SE, TR), OAP1 patent 
(BF, BJ, CF, CG, CI, CM, GA, GN, GQ, GW, ML, MR, 
NE, SN, TD, TO). 

Published: 

— without international search report and to be republished 
upon receipt of that report 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations'' appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



< 

00 
C\ 
<N 
O 

c5 

(57) Abstract: The present invention features substantially pure HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), an 
^ HDRP(ANLS) polypeptides, and isolated nucleic acid molecules encoding those polypeptides. The present invention also features 
^ vectors containing///)^ CP, HDAC9a t HDAC9(ANLS), HDAC9a(ANLS), and HDRP(ANLS) nucleic acid sequences, and cells 
^ containing those vectors. 



(54) Title: HDAC9 POLYPEPTIDES AND POLYNUCLEOTIDES AND USES THEREOF 



WO 02/102984 



PCT/US02/19051 



-1- 



HDAC9 POLYPEPTIDES AND POLYNUCLEOTIDES AND USES THEREOF 

RELATED APPLICATIONS 

This application claims the benefit of U.S. Provisional Application No. 
5 60/298,173 filed on June 14, 2001, U.S. Provisional Application No. 60/3 1 1,686 
filed on August 10, 2001, and U.S. Provisional Application No. 60/316,995, filed on 
September 4, 2001. The entire teachings of the above applications are incorporated 
herein by reference. 
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The invention was supported, in whole or in part, by grant CA-0974823 from 
the National Cancer Institute. The Government has certain rights in the invention. 



15 



20 



BACKGROUND OF THE INVENTION 

The N-terminal tails of core histones are covalently modified by post- 
translational modifications, including acetylation and phosphorylation. Evidence 
suggests that these covalent modifications play important roles in several biological 
activities involving chromatin, e.g., transcription and replication. Histone 
deacetylases (HDACs) catalyze the removal of the acetyl group from the lysine 
residues in the N-teiminal tails of nucleosomal core histones resulting in a more 
compact chromatin structure, a configuration that is generally associated with 
repression of transcription. 

Five proteins and/or open reading frames in yeast (RPD3, HDA1, HOS1, 
HOS2 and HOS3) that share significant homology in the catalytic domain have been 
25 identified as HDACs based upon their sequence homology to human HDAC1. To 
date, eight HDACs have been identified in mammalian cells, and classified into two 
classes based on their structure and similarity to yeast RPD3 or HDA1 proteins. 
Recently, Sir2 family proteins that are structurally unrelated to the five proteins 
aforementioned have been identified as NAD-dependent HDACs. Class I HDACs 
30 are the yeast RPD3 homologs HDAC1, 2, 3, and 8, and are composed primarily of a 
catalytic domain. Class H HDACs are the yeast HDA1 homologs HDAC4, 5, 6; and 
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7. HDAC4, 5, and 7 contain a long non-catalytic N-terminal end and a C-terminal 
HP AC catalytic domain while HDAC6 has two HDAC catalytic domains. 

It has also been determined that histone deacetylases can be sensitive to 
small molecules, including trichostatin A (TS A), trapoxin, and butyrate. For 
5 example, the yeast RPD3 and HDA1 and mammalian HDAC1, 2, 3, 4, 5, 6, 7 and 8 
are sensitive to inhibition by trichostatin A (TSA). The Sir2 family HDACs, yeast 
HOS3 and Drosophila melanogaster dHDAC6, however, appear to be relatively 
insensitive to TSA. A class of hybrid bipolar compounds, such as suberoylanilide 
hydroxamic acid (SAHA) have also been shown to inhibit histone deacetylases and 

10 induce terminal differentiation and/or apoptosis in various transformed cells. 

Examples of such compounds can be found in U.S. Patent Nos. 5,369,108, issued on 
November 29, 1994, 5,700,81 1, issued on December 23, 1997, and 5,773,474, issued 
on June 30, 1998 to Breslow et al, as well as U.S. Patent Nos. 5,055,608, issued on 
October 8, 1991, and 5,175,191, issued on December 29, 1992 to Marks et al 9 the 

1 5 entire content of all of which are hereby incorporated by reference. 

The identification of the mechanisms by which histones are deacetylated, and 
the characterization of histone deacetylase function would be of great benefit in 
understanding how gene transcription is controlled, how the cell cycle is regulated, 
and how cells are signaled to undergo terminal differentiation and/or apoptosis. 

20 Elucidation of such mechanisms can lead to improved therapeutics for many 
diseases, in particular those characterized by cell proliferation or a lack of cell 
differentiation or apoptosis, for example, cancer. 

SUMMARY OF THE INVENTION 
25 The present invention relates to isolated or recombinant histone deacetylase 

polypeptides, and isolated histone deacetylase nucleic acid molecules encoding those 
polypeptides, as well as vectors and cells containing those isolated nucleic acid 
molecules. 

In one aspect of the invention, the isolated or recombinant histone 
30 deacetylase polypeptide is selected from a) an isolated or recombinant polypeptide 
comprising SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ 
ID NO: 10; and b) a polypeptide having at least 60% sequence identity with any one 
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of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ ID NO: 
10. In one embodiment, the isolated or recombinant bistone deacetylase polypeptide 
consists of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ 
ID NO: 10. In another embodiment, the isolated or recombinant bistone deacetylase 
5 polypeptide is mammalian; preferably, the isolated or recombinant bistone 
deacetylase polypeptide is human. 

hi another aspect, the invention features an isolated nucleic acid molecule 
selected from a) an isolated nucleic acid comprising SEQ ID NO: 1, SEQ ID NO: 3, 
SEQ ID NO: 5, SEQ ID NO: 7, or SEQ ID NO: 9; b) a complement of an isolated ' 
10 nucleic acid comprising SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID 
NO: 7, or SEQ ID NO: 9; c) an isolated nucleic acid encoding a bistone deacetylase 
polypeptide of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or 
SEQ ID NO: 10; d) a complement of an isolated nucleic acid encoding a histone 
deacetylase polypeptide of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID 
1 5 NO: 8, or SEQ ID NO: 1 0; e) a nucleic acid that is hybridizeable under high 

stringency conditions to a nucleic acid molecule that encodes any of SEQ ID NO: 2, 
SEQ ID NO: 4, SEQ ID NO: 6, or SEQ ID NO: 8, or a complement thereof; or f) a ' 
nucleic acid molecule that is hybridizeable under high stringency conditions to a 
nucleic acid comprising SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, or SEQ ID 
20 NO: 7; and g) an isolated nucleic acid molecule that has at least 55% sequence 
identity with any one of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID 
NO: 7, SEQ ID NO: 9, or a complement thereof. In one embodiment, the isolated 
nucleic acid molecule consists of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, 
SEQ ID NO: 7, or SEQ ID NO: 9. In another embodiment, the isolated nucleic acid 
25 molecule is mammalian; preferably, the isolated nucleic acid molecule is human. 

In other aspects, the invention features a vector comprising the isolated 
histone deacetylase nucleic acid molecule described above, a cell comprising the 
vector, and a cell comprising the isolated histone deacetylase nucleic acid molecule 
described above. 

30 hi another aspect, the invention features a purified antibody that selectively 

binds a bistone deacetylase polypeptide described above. 
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In yet another aspect, the invention features a method of identifying a 
compound that modulates expression of a histone deacetylase nucleic acid molecule 
described above. The method comprises the steps of a) contacting the nucleic acid 
molecule with a candidate compound under conditions suitable for expression; and 
5 b) assessing the level of expression of the nucleic acid molecule. A candidate 
compound that increases or decreases expression of the nucleic acid molecule 
relative to a control is a compound that modulates expression of the nucleic acid 
molecule. In one embodiment, the method is carried out in a cell or animal. In 
another embodiment, the method is carried out in a cell free system. 
10 The invention also features a method of treating a cell proliferation disease, 

an apoptotic disease, or a cell differentiation disease, for example, cancers such as 
lymphoma, leukemia, melanoma, ovarian cancer, breast cancer, pancreatic cancer, 
prostate cancer, colon cancer, and lung cancer and myeloproliferative disorders, 
including polycythemia vera, essential thrombocythemia, agnogenic myeloid 

15 metaplasia, and chronic myelogenous leukemia in an individual, comprising 
administering a compound identified by the above method. 

In still another aspect, the invention features a method of identifying a 
compound that modulates the enzymatic activity of the histone deacetylase 
polypeptide described above. The method comprises the steps of a) contacting the 

20 polypeptide with a candidate compound under conditions suitable for enzymatic 
reaction; and b) assessing the activity level of the polypeptide. A candidate 
compound that increases or decreases the activity level of the polypeptide relative to 
a control is a compound that modulates the enzymatic activity of the polypeptide. In 
one embodiment, the method is carried out in a cell or animal. In another 

25 embodiment, the method is carried out in a cell free system. 

In yet another embodiment, the polypeptide is further contacted with a 
substrate for the polypeptide, wherein the substrate is selected from the group 
consisting of a cell proliferation disease binding agent, an apoptotic disease binding 
agent, and a cell differentiation disease binding agent. In one embodiment, the 

30 candidate compound is an inhibitor. In another embodiment, candidate compound is 
an activator. 
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In another aspect, the invention features a method of identifying a compound 
that modulates the transcriptional repression activity of the histone deacetylase 
polypeptide described above. The method comprises the steps of a) contacting the 
polypeptide with a candidate compound under conditions suitable for a 
5 transcriptional repression reaction; and b) assessing the transcriptional repression 
activity level of the polypeptide. A candidate compound that increases or decreases 
the transcriptional repression activity level of the polypeptide relative to a control is 
a compound that modulates the transcriptional repression activity of the polypeptide, 
hi one embodiment, the method is carried out in a cell or animal. In another 
1 0 embodiment, the method is carried out in a cell free system. 

In yet another embodiment, the polypeptide is further contacted with a 
substrate for the polypeptide, wherein the substrate is selected from the group 
consisting of a cell proliferation disease binding agent, an apoptotic disease binding 
agent, and a cell differentiation disease binding agent. In one embodiment, the 
1 5 candidate compound is an inhibitor. In another embodiment, candidate compound is 
an activator. 

In another aspect, the invention features a method of identifying a compound 
that modulates expression of a histone deacetylase nucleic acid molecule described 
above. The method comprises the steps of a) providing a nucleic acid molecule 

20 comprising a promoter region of the histone deacetylase nucleic acid molecule 
described above, or part of such a promoter region, operably linked to a reporter 
gene; b) contacting the nucleic acid molecule or with a candidate compound; and c) 
assessing the level of the reporter gene. A candidate compound that increases or 
decreases expression of the reporter gene relative to a control is a compound that 

25 modulates expression of the histone deacetylase nucleic acid molecule described 
above. In one embodiment, the method is carried out in a cell. 

In still another aspect, the invention features a method of identifying a 
polypeptide that interacts with a histone deacetylase polypeptide described above in 
a yeast two-hybrid system. The method comprises the steps of a) providing a first 

30 nucleic acid vector comprising a nucleic acid molecule encoding a DNA binding 
domain and the histone deacetylase polypeptide described above; b) providing a 
second nucleic acid vector comprising a nucleic acid encoding a transcription 
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activation domain and a nucleic acid encoding a test polypeptide; c) contacting the 
first nucleic acid vector with the second nucleic acid vector in a yeast two-hybrid 
system; and d) assessing transcriptional activation in the yeast two-hybrid system. 
An increase in transcriptional activation relative to a control indicates that the test 
5 polypeptide is a polypeptide that interacts with the histone deacetylase polypeptide 
described above. 

The invention also features a pharmaceutical composition comprising a 
histone deacetylase polypeptide described above. 

In addition, the present invention features a method of diagnosing a cell 
1 0 proliferation disease, an apoptotic disease, or a cell differentiation disease in a 

subject. The method comprises the steps of a) obtaining a sample from the subject; 
and b) assessing the level of activity or expression of the histone deacetylase 
polypeptide described above or the level of the nucleic acid molecule described 
above in the sample. If the level is increased relative to a control, then the subject 
15 has an increased likelihood of having a cell proliferation disease, an apoptotic 
disease, or a cell differentiation disease, and if the level is decreased relative to a 
control, then the subject has a decreased likelihood of having a cell proliferation 
disease, an apoptotic disease, or a cell differentiation disease. In one embodiment, 
the polypeptide level is assayed using immunohistochemistry techniques. In another 
20 embodiment, the nucleic acid molecule level is assayed using in situ hybridization 
techniques. 

Compounds and/or polypeptides identified in the above-described screening 
methods are also part of the present invention. 



25 DESCRIPTION OF THE FIGURES 

FIG. 1 is a schematic representation of the order in which FIGS. lA-lO 
should be viewed. 

FIGS. 1A-1C show the cDNA sequence of HDAC9 (SEQ ID NO: 1). The 
arrows and numbers in the HDAC9 sequence indicate exons. The boxed portion of 
30 the sequence indicates the HDAC domain. 



WO 02/102984 



-7- 



PCT/US02/19051 



FIGS. 1D-1G show the cDNA sequence of HDAC9a (SEQ ID NO: 3). The 
arrows and numbers in the HDAC9a sequence indicate exons. The boxed portion of 
the sequence indicates the HDAC domain. 

FIGS. 1H-1I show the cDNA sequence of HDRP(ANLS) (SEQ ID NO:9). 
5 FIGS. 1 J-1L show the cDNA sequence of HDAC9(ANLS) (SEQ ID NO:5). 

FIGS. 1M-10 show the cDNA sequence of HDAC9a(ANLS) (SEQ ID 

NO:7). 

FIG. 2 is a schematic representation of the order in which FIGS. 2A-2E 
should be viewed. 

1 0 FIG. 2 A shows the amino acid sequence of HDAC9 (SEQ ID NO: 2). 

FIG. 2B shows the amino acid sequence of HDAC9a (SEQ ID NO: 4). 
FIG. 2C shows the amino acid sequence of HDAC9(ANLS) (SEQ ID NO: 6). 
FIG. 2D shows the amino acid sequence of HDAC9a(ANLS) (SEQ ID NO: 

8). 

15 FIG. 2E shows the amino acid sequence of and HDRP(ANLS) (SEQ ID NO: 

10) . . t 

FIG. 3 is a schematic representation of the order in which FIGS. 3A-3C 
should be viewed. 

FIGS. 3A-3C show an amino acid sequence alignment of HDRP (SEQ ID 
20 NO: 1 1), HDAC9 (SEQ ID NO: 2), HDAC9a (SEQ ID NO: 4), and HDAC4 (SEQ 
ID NO: 12) polypeptides. Amino acid sequences of HDAC9 (GenBank Accession: 
AY032737; SEQ ID NO: 2) and HDAC9a (GenBank Accession: AY03273 8; SEQ 
ID NO: 4) are aligned with HDRP (GenBank Accession: B AA34464; SEQ D NO: 

1 1) and HDAC4 (GenBank Accession: NPJ)06028; SEQ ID NO: 12). The identical 
25 residues in all proteins are boxed with solid lines. The similar residues are boxed 

with dotted lines. 

FIG. 4 shows a schematic representation of the human HDAC9 gene 
structure. The striped boxes represent exons present in isoforms HDRP, HDAC9a, 
and HDAC9. The lines represent introns. Broken lines are used for larger introns 
JO (with size in base pair on top). The 5' untranslated region cDNA and coding region 
cDNA are represented here. Exons 1-12 encode a non-catalytic domain of the 
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polypeptides, and exons 14-21 encode the histone deacetylase catalytic domain of 
the polypeptides, which provide the polypeptides with deacetylase activity. 

FIG. 5 is a schematic representation of the order in which FIGS. 5A-5D 
should be viewed. 

5 FIGS. 5A-5D show the nucleic acid sequence of HDAC9, containing all 

exons expressed in the various isoforms of HDAC9, HDAC9a, HDAC9(ANLS), 
HDA C9a(ANLS), and HDRP(ANLS) of the present invention (SEQ ID NO: 1 3). 

FIG. 6A is a scanned imaged of a multiple human tissue Northern blot that 
was probed to determine mRNA expression of HDAC9 using a cDNA probe that 

10 recognizes both HDAC9 and HDAC9a. The tissues examined are lane 1, heart; lane 
2, brain; lane 3, placenta; lane 4, lung; lane 5, liver; lane 6, skeletal muscle; lane 7, 
kidney; and lane 8, pancreas. Positions of the RNA size marker in kilobases (kb) are 
indicated to the left of the blot. 

FIG. 6B is a scanned image of an electrophoretic gel showing the results of 

1 5 RT-PCR analyses of mRNA from the same tissues as examined in the Northern blot 
of FIG. 6A to determine the distribution of HDAC9 and HDAC9a mRNA among 
these tissues. PCR products were resolved by agarose gel electrophoresis and 
visualized by ethidium bromide under UV light. A 1-kb DNA ladder was run on 
both sides of the gel with the size (in kb) indicated on the left. On the right side, the 

20 expected products for HDAC9 and HDAC9a are indicated as 9 and 9a, respectively. 
FIG. 7 is a graph of HDAC enzymatic activity of HDAC anti-FLAG- 
immunoprecipitated proteins isolated from vector control, HDAC9-FLAG, and 
HDAC9a-FLAG transfected 293T cells, as measured in fluorescence units using 
FLUOR DE LYS™ as a substrate in the presence or absence of 1 \iM TSA. Results 

25 are shown as the mean of three independent assays. The inset is a scanned image of 
an anti-FLAG Western blot showing the amount of proteins used in the assay. V, 
Vector control; 9, HDAC9-FLAG; and 9a, HDAC9a-FLAG. 

FIG. 8 is a graph of HDAC enzymatic activity of HDAC anti-FLAG- 
immunoprecipitated proteins isolated from vector control, and HDAC9a-FLAG 

30 (treated with 2 pM SAHA or left untreated) transfected 293T cells, as measured by 
3 H-acetic acid released from 3 H-histones in the presence or absence of 2 |iM SAHA. 
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Vector control; HDAC9a, HDAC9a-FLAG; and HDAC9a+, HDAC9a-FLAG + 
SAHA. 

FIG. 9A shows a scanned image of a Western blot of 293T whole cell lysate 
and anti-FLAG immunoprecipitates from 293T cells transfected with vector, 
5 HDAC9-FLAG or HDAC9a-FLAG using antibodies against MEF2 and FLAG. Top 
panel, anti-MEF2 Western; bottom panel, anti-FLAG Western. L, 293T whole cell 
lysate; V, vector control IP; 9, HDAC9-FLAG IP; 9a, HDAC9a-FLAG IP. 

FIG. 9B is a graph showing the transcription level of p3XMEF2-Zwc in the 
presence or absence of pcDNA3 empty vector (-), pCMV-MEF2C, and/or a vector 
10 encoding pFLAG-HDAC9 or pFLAG-HDAC9a. P 3XMEF2-Z«c (lOOng) andpRL- 
TK (5 ng) were transfected into 293T cells with pcDNA3 empty vector (-) or with 
PCMV-MEF2C (100 ng) (+) along with the indicated amount of pFLAG-HDAC9 or 
pFLAG-HDAC9a. pFLAG empty vector was used to adjust the DNA to an equal 
amount in each transfection. The firefly luciferase activity was first normalized to 
15 the co-transfected Renilla luciferase activity and the value for MEF2C alone was 
then set as 1. Results are shown as the mean offeree independent transfections +/- 
standard deviation. 

FIG. 10 shows a schematic representation of the HDAC domains of human 
non-Sir2 family HDACs and HDRP. The boxes represent histone deacetylase 
20 (HDAC) domains. 

FIG. 1 1 is a schematic representation of the order in which FIGS. 1 1 A-l IF 
should be viewed. 

FIGS. 1 1 A-l IF show the nucleotide sequence of the vector pFLAG-CMV- 
5b-HDAC9 (VR1) (SEQ ID NO: 14). Lowercase letters are vector backbone, 
25 uppercase letters are HDAC9 sequence. "Acc" was added at the beginning of the 
HDAC9 sequence for translation initiation. 

FIG. 12 is a schematic representation of the order in which FIGS. 12-1 
through 12-66 should be viewed. 

FIGS. 12-1 through 12-66 show the nucleotide sequence of the vector 
30 pFLAG-CMV-5b-HDAC9a (VR2), with restriction enzyme sites indicated (SEQ ID 
NO: 14). 
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FIG. 13 is a schematic representation of the order in which FIGS. 13A-13E 
should be viewed. 

FIGS. 13A-13E show the nucleotide sequence of the vector pFLAG-CMV- 
5b-HDAC9a (VR2) (SEQ ID NO: 15). Lowercase letters are vector backbone, 
5 uppercase letters are HDAC9a sequence. "Acc" was added at the beginning of the 
HDAC9a sequence for translation initiation. 

FIG. 14 is a schematic representation of the order in which FIGS. 14-1 
through 14-61 should be viewed. 

FIGS. 14-1 through 14-61 show the nucleotide sequence of the vector 
10 pFLAG-CMV-5b-HDAC9a (VR2), with restriction enzyme sites indicated (SEQ ID 
NO: 15). 

DETAILED DESCRIPTION OF THE INVENTION 

A protein designated HDRP (See Zhou et aL, Proc. Natl. Acad. Sci. USA, 

15 97:1056-1061 (2000)) (also called MTTR (See Sparrow et al, EMBO J. 18:5085- 
5098(1999); Zhang et al, J. Biol. Chem., 276:35-39 (2001); and Zhang et al, Proc. 
Natl. Acad. Sci. USA, 98:7354-7359 (2001)) that is 50% identical to the N-terminal 
domains of histone deacetylase 4 (HDAC4) and histone deacetylase 5 (HDAC5) was 
recently identified. The cloning and characterization of a novel histone deacetylase, 

20 HDAC9, of which HDRP is an alternatively spliced isoform is described herein. The 
cDNA sequence of HDAC9 is shown in FIGS. 1A-1C (SEQ ID NO: 1), and the 
HDAC9 amino acid sequence is shown in FIG. 2A (SEQ ID NO: 2). In addition to 
cloning HDAC9, other alternatively spliced isoforms of HDAC9, designated as 
HDAC9a (a polypeptide that is 132 amino acids shorter at the C-terminal end than 

25 HDAC9), and isoforms of HDAC9, HDAC9a, and HDRP polypeptides that lack the 
nuclear localization signal (NLS) in the N-terminal non-catalytic end of HDAC9, 
termed HDAC9(ANLS), HDAC9a(ANLS), and HDRP(ANLS), respectively were 
also identified. The cDNA sequence of HDAC9a is shown in FIGS. 1D-1G (SEQ 
ID NO: 3), and the HDAC9a amino acid sequence is shown in FIG. 2B (SEQ ID 

30 NO: 4). The cDNA sequence of HDAC9 lacking amino acids encoding an NLS 

(HDA C9(ANLS)) is shown in FIGS. 1 ML (SEQ ID NO: 5), and the HDAC9 lacking 
an NLS amino acid sequence is shown in FIG. 2C (SEQ ID NO: 6). The cDNA 
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sequence of HDAC9a encoding a polypeptide lacking an NLS (HDAC9a(ANLS)) is 
shown in FIGS. 1M-10 (SEQ ID NO: 7), and the HDAC9a lacking an NLS amino 
acid sequence is shown in FIG. 2D (SEQ ID NO: 8). The cDNA sequence of HDRP 
encoding a polypeptide lacking an NLS (HDRP(ANLS)) is shown in FIGS. 1H-1I 
5 (SEQ ID NO: 9), and the HDRP lacking an NLS amino acid sequence is shown in 
FIG. 2E (SEQ ID NO: 10). 



POLYPEPTIDES OF THE INVENTION 

The present invention features isolated or recombinant HDAC9 polypeptides, 
10 HDAC9a polypeptides, HDAC9(ANLS) polypeptides, HDAC9a(ANLS) 

polypeptides, and HDRP(ANLS) polypeptides, and fragments, derivatives, and 
variants thereof, as well as polypeptides encoded by nucleotide sequences described 
herein (e.g., other variants). As used herein, the term "polypeptide" refers to a 
polymer of amino acids, and not to a specific length; thus, peptides, oligopeptides, 
1 5 and proteins are included within the definition of a polypeptide. 

As used herein, a polypeptide is said to be "isolated," "substantially pure," or 
"substantially pure and isolated" when it is substantially free of cellular material, 
when it is isolated from recombinant or non-recombinant cells, or free of chemical 
precursors or other chemicals when it is chemically synthesized. Typically, the 
20 HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 

polypeptide is isolated, substantially pure, or substantially pure and isolated when it 
has a relative increased concentration or activity of HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS), in comparison to total HDAC 
concentration or activity. Preferably the increased activity or concentration of the 
25 HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) is at least 
2-fold, more preferably, at least 5-fold, and most preferably, at least 10 fold, in 
comparison to total HDAC concentration or activity. In addition, a polypeptide can 
be joined to another polypeptide with which it is not normally associated in a cell 
{e.g., in a "fusion protein") and still be "isolated," "substantially pure," or 
30 "substantially pure and isolated." An isolated, substantially pure, or substantially 
pure and isolated polypeptide may be obtained, for example, using affinity 
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purification techniques described herein, as well as other techniques described herein 
and known to those skilled in the art. 

By a "histone deacetylase polypeptide" is meant a polypeptide having histone 
deacetylase activity, transcription repression activity, and/or the ability to deacetylate 
5 other substrates, for example, transcription factors, including p53, CoRest, E2F, 
GATA-1, TFHe, and TFIDF that normally have a nuclear or cytoplasmic location in a 
cell A histone deacetylase polypeptide is also a polypeptide whose activity can be 
inhibited by molecules having HDAC inhibitory activity. These molecules fall into 
four general classes: 1) short-chain fatty acids (e.g., 4-phenylbutyrate and valproic 

10 acid); 2) hydroxamic acids(e.g, SAHA, Pyroxamide, trichostatin A (TSA), 

oxamflatin and CHAPs, such as, CHAP1 and CHAP 31); 3) cyclic tetrapeptides 
(Trapoxin A, Apicidin and Depsipeptide (FK-228, also known as FR901 1228); 4) 
benzamides (e.g., MS-275); and other compounds such as Scriptaid. Examples of 
such compounds can be found in U.S. Patent Nos. 5,369,108, issued on November 

15 29, 1994, 5,700,81 1, issued on December 23, 1997, and 5,773,474, issued on June 
30, 1998 to Breslow et al, U.S. Patent Nos. 5,055,608, issued on October 8, 1991, 
and 5,175,191, issued on December 29, 1992 to Marks et al., as well as, Yoshida et 
a/., Bioessays 17, 423-430 (1995), Saito et al 9 PNAS USA 96, 4592-4597, (1999), 
Furamai et al , PNAS USA 98 (1), 87-92 (2001), Komatsu et al , Cancer Res. 

20 61(1 1), 4459-4466 (2001), Su et al 9 Cancer Res. 60, 3137-3 142 (2000), Lee et al, 
Cancer Res. 61(3), 931-934 and Suzuki et al J. Med. Chem. 42(15), 3001-3003 
(1999) the entire content of all of which are hereby incorporated by reference. 
Examples of such histone deacetylase polypeptides include HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), HDRP(ANLS); a substantially pure polypeptide 

25 comprising SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ 
ID NO: 10; and a polypeptide having preferably at least 60%, more preferably, 70%, 
75%, 80%, 85%, or 90%, and most preferably, 95% sequence identity to any one of 
SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ ID NO: 10, 
as determined using the BLAST program and parameters described herein. 

30 In one embodiment, the histone deacetylase polypeptide has histone 

deacetylase activity, transcription repression activity, the ability to deacetylate 
substrates, or is inhibited by trichostatin A or a hybrid polar compound such as 
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SAHA. In another embodiment, the HDAC9(ANLS) polypeptide has any two of the 
above biological activities. In still another embodiment, the HDAC9(ANLS) 
polypeptide has any three of the above biological activities. In yet another 
embodiment, the HDAC9(ANLS) polypeptide has all of the above biological 
5 activities. 

An HDAC9 polypeptide is a histone deacetylase polypeptide as described 
above. An HDAC9 polypeptide preferably has at least 60%, more preferably, 70%, 
75%, 80%, 85%, or 90%, and most preferably, 95% sequence identity to SEQ ID 
NO: 2, as determined using the BLAST program and parameters described herein. 

10 An HDAC9 polypeptide is also a polypeptide that comprises the amino acids 

encoded by exons 23, 24, 25 and/or 26, and that does not comprise the amino acids 
encoded by exon 13 of the HDAC9 nucleic acid sequence, as shown in FIGS. 1 A- 
1 C, FIG. 4, and FIGS. 5A-5D. Preferably, an HDAC9 polypeptide comprises the 
sequence of SEQ ID NO: 2. More preferably, an HDAC9 polypeptide consists of 

15 the sequence of SEQ ID NO: 2. An HDAC polypeptide is also a polypeptide 

comprising the amino acid sequence of the polypeptide encoded by the nucleic acid 
sequence of SEQ ID NO: 1. 

An HDAC9a polypeptide is a histone deacetylase polypeptide as described 
above. An HDAC9a polypeptide preferably has at least 60%, more preferably, 70%, 

20 75%, 80%, 85%, or 90%, and most preferably, 95% sequence identity to SEQ ID 
NO: 4, as determined using the BLAST program and parameters described herein. 
An HDAC9a polypeptide is also a polypeptide that comprises the amino acids 
encoded by exon 22, and that does not comprise the amino acids encoded by exons 
13, 23, 24, 25, or 26 of the HDAC9 nucleic acid sequence, as shown in FIGS. 1D- 

25 lG,FIG.4,andFIGS.5A-5D. Preferably, an HDAC9a polypeptide comprises the 
sequence of SEQ ID NO: 4. More preferably, an HDAC9a polypeptide consists of 
the sequence of SEQ ID NO: 4. An HDAC9a polypeptide is also a polypeptide 
comprising the amino acid sequence of the polypeptide encoded by the nucleic acid 
sequence of SEQ ID NO: 3. 
30 An HDAC9(ANLS) is a histone deacetylase polypeptide as described above. 

An HDAC9(ANLS) polypeptide does not comprise a nuclear localization signal 
(NLS). An HDAC9(ANLS) polypeptide preferably has at least 60%, more 
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preferably, 70%, 75%, 80%, 85%, or 90%, and most preferably, 95% sequence 
identity to SEQ ID NO: 6, as determined using the BLAST program and parameters 
described herein. An HDAC9(ANLS) polypeptide is also a polypeptide that 
comprises the amino acids encoded by exons 23, 24, 25, and/or 26, and that does not 
5 comprise the amino acids encoded by exons 7 or 1 3 of the HDAC9 nucleic acid 
sequence, as shown in FIGS. 1J-1L, and FIGS. 5A-5D. Preferably, an 
HDAC9(ANLS) polypeptide comprises the sequence of SEQ ID NO: 6. More 
preferably, an HDAC9(ANLS) polypeptide consists of the sequence of SEQ ID NO: 
6. An HDAC9(ANLS) polypeptide is also a polypeptide comprising the amino acid 
10 sequence of the polypeptide encoded by the nucleic acid sequence of SEQ ID NO: 5. 
An HDAC9a(ANLS) polypeptide is a histone deacetylase polypeptide as 
described above. An HDAC9a(ANLS) does not comprise a nuclear localization 
signal (NLS). An HDAC9a(ANLS) polypeptide preferably has at least 60%, more 
preferably, 70%, 75%, 80%, 85%, or 90%, and most preferably, 95% sequence 
15 identity to SEQ ID NO: 8, as determined using the BLAST program and parameters 
described herein. An HDAC9a(ANLS) polypeptide is also a polypeptide that 
comprises the amino acids encoded by exon 22, and that does not comprise the 
amino acids encoded by exons 7, 13, 23, 24, 25, or 26 of the HDAC9 nucleic acid 
sequence, as shown in FIGS. 1M-10, and FIGS. 5A-5D. Preferably, an 
20 HDAC9a(ANLS) polypeptide comprises the sequence of SEQ ID NO: 8. More 

preferably, an HDAC9a(ANLS) polypeptide consists of the sequence of SEQ ID NO: 
8. An HDAC9a(ANLS) polypeptide is also a polypeptide comprising the amino acid 
sequence of the polypeptide encoded by the nucleic acid sequence of SEQ ID NO: 7. 
An HDRP(ANLS) polypeptide is a histone deacetylase polypeptide as 
25 described above. An HDRP(ANLS) does not comprise a nuclear localization signal 
(NLS). An HDRP(ANLS) polypeptide preferably has at least 60%, more preferably, 
70%, 75%, 80%, 85%, or 90%, and most preferably, 95% sequence identity to SEQ 
ID NO: 10, as determined using the BLAST program and parameters described 
herein. An HDRP(ANLS) polypeptide is also a polypeptide that does not comprise 
30 the amino acids encoded by exons 7 or 13-26 of the HDAC9 nucleic acid sequence, 
as shown in FIGS. 1H-1I and FIGS. 5A-5D. Preferably, an HDRP(ANLS) 
polypeptide comprises the sequence of SEQ ID NO: 10. More preferably, an 
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HDRP(ANLS) polypeptide consists of the sequence of SEQ ID NO: 10. An 
HDRP(ANLS) polypeptide is also a polypeptide comprising the amino acid sequence 
of the polypeptide encoded by the nucleic acid sequence of SEQ ID NO: 9. 

The polypeptides of the invention can be purified to homogeneity. It is 
5 understood, however, that preparations in which the polypeptide is not purified to 
homogeneity are useful. The critical feature is that the preparation allows for the 
desired function of the polypeptide, even in the presence of considerable amounts of 
other components. Thus, the invention encompasses various degrees of purity. In 
one embodiment, the language "substantially free of cellular materiar includes 
10 preparations of the polypeptide having less than about 30% (by dry weight) other 
proteins {i.e., contaminating protein), less than about 20% other proteins, less than 
about 10% other proteins, or less than about 5% other proteins. 

When a polypeptide is recombinantly produced, it can also be substantially 
free of culture medium, i.e., culture medium represents less than about 20%, less 
15 than about 10%, or less than about 5% of the volume of the polypeptide preparation. 
The language "substantially free of chemical precursors or other chemicals" includes 
preparations of the polypeptide in which it is separated from chemical precursors or 
other chemicals that are involved in its synthesis. In one embodiment, the language 
"substantially free of chemical precursors or other chemicals" includes preparations 
20 of the polypeptide having less than about 30% (by dry weight) chemical precursors 
or other chemicals, less than about 20% chemical precursors or other chemicals, less 
than about 10% chemical precursors or other chemicals, or less than about 5% 
chemical precursors or other chemicals. 

In one embodiment, a polypeptide of the invention comprises an amino acid 
25 sequence encoded by a nucleic acid molecule comprising a nucleotide sequence 

selected from the group consisting of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, 
SEQ ID NO: 7, SEQ ID NO: 9, and complements and portions thereof, (e.g., a 
complement of any one of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ED 
NO: 7, SEQ ID NO: 9 or a portion of any one of SEQ ID NO: 1 or SEQ ID NO: 3, 
30 SEQ ID NO: 5, SEQ ID NO: 7, or SEQ ID NO: 9). 

The polypeptides of the invention also encompass fragments and sequence 
variants. Variants include a substantially homologous polypeptide encoded by the 
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same genetic locus in an organism, le. , an allelic variant, as well as other variants. 
Variants also encompass polypeptides derived from other genetic loci in an 
organism, but having substantial homology to a polypeptide encoded by a nucleic 
acid molecule comprising a nucleotide sequence selected from the group consisting 
5 of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, 
and complements and portions thereof, or having substantial homology to a 
polypeptide encoded by a nucleic acid molecule comprising a nucleotide sequence 
selected from the group consisting of nucleotide sequences encoding any one of SEQ 
ID NO: 2,* SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ ID NO: 10. 
10 Variants also include polypeptides substantially homologous or identical to these 
polypeptides but derived from another organism, z.e., an ortholog. Variants also 
include polypeptides that are substantially homologous or identical to these 
polypeptides that are produced by chemical synthesis. Variants also include 
polypeptides that are substantially homologous or identical to these polypeptides that 
1 5 are produced by recombinant methods. 

As used herein, two polypeptides (or a region of the polypeptides) are 
substantially homologous or identical when the amino acid sequences are at least 
about 60-65%, typically at least about 70-75%, more typically at least about 80-85%, 
and most typically greater than about 90-95% or more homologous or identical. A 
20 substantially identical or homologous amino acid sequence, according to the present 
invention, will be encoded by a nucleic acid molecule hybridizing to SEQ ID NO: 1, 
SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, or a portion thereof, 
under stringent conditions as more particularly described herein, or will be encoded 
by a nucleic acid molecule hybridizing to a nucleic acid sequence encoding SEQ ID 
25 NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, or portion 
thereof, under stringent conditions as more particularly described herein. 

The percent identity of two nucleotide or amino acid sequences can be 
determined by aligning the sequences for optimal comparison purposes (e.£., gaps 
can be introduced in the sequence of a first sequence). The nucleotides or amino 
30 acids at corresponding positions are then compared, and the percent identity between 
the two sequences is a function of the number of identical positions shared by the 
sequences (Le.,% identity = # of identical positions/total # of positions x 100). In 
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certain embodiments, the length of the HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), and HDRP(ANLS) amino acid or nucleotide sequence aligned for 
comparison purposes is at least 30% preferably, at least 40%, more preferably, at 
least 60%, and even more preferably, at least 70%, 80%, 90%, or 100% of the length 
5 of the reference sequence, for example, those sequences provided in FIGS . 1 A-l O 
and 2A-2E. The actual comparison of the two sequences can be accomplished by 
well-known methods, for example, using a mathematical algorithm. A preferred, 
non-limiting example of such a mathematical algorithm is described in Karlin et ah, 
Proc. Natl. Acad. Sci. USA, 90:5873-5877 (1993). Such an algorithm is 
10 incorporated into the BLASTN and BLASTX programs (version 2.2) as described in 
Schaffer et al y Nucleic Acids Res., 29:2994-3005 (2001). When utilizing BLAST 
and Gapped BLAST programs, the default parameters of the respective programs 
(e.g., BLASTN) can be used. See http://www.ncbi.nlm.nih.gov, as available on 
August 10, 2001. In one embodiment, the database searched is a non-redundant 
15 (NR) database, and parameters for sequence comparison can be set at: no filters; 
Expect value of 10; Word Size of 3; the Matrix is BLOSUM62; and Gap Costs have 
an Existence of 1 1 and an Extension of 1. 

Another preferred, non-limiting example of a mathematical algorithm 
utilized for the comparison of sequences is the algorithm of Myers and Miller, 
20 CABIOS (1 989). Such an algorithm is incorporated into the ALIGN program 
(version 2.0), which is part of the GCG (Accelrys) sequence alignment software 
package. When utilizing the ALIGN program for comparing amino acid sequences, 
a PAM120 weight residue table, a gap length penalty of 12 , and a gap penalty of 4 
can be used. Additional algorithms for sequence analysis are known in the art and 
25 include ADVANCE and ADAM as described in Torellis and Robotti, Comput. 
Appl. Biosci., 10: 3-5 (1994); and FASTA described in Pearson and Lipman, Proc. 
Natl. Acad. Sci USA, 85: 2444-8 (1988). 

In another embodiment, the percent identity between two amino acid 
sequences can be accomplished using the GAP program in the GCG software 
30 package (available at http://www.acceliys.com, as available on August 3 1 , 2001) 
using either a Blossom 63 matrix or a PAM250 matrix, and a gap weight of 12, 10, 
8, 6, or 4 and a length weight of 2, 3, or 4. In yet another embodiment, the percent 
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identity between two nucleic acid sequences can be accomplished using the GAP 
program in the GCG software package (available at http://www.cgc.com), using a 
gap weight of 50 and a length weight of 3. 

The invention also encompasses HDAC9, HDAC9a, HDAC9(ANLS), 
5 HDAC9aANLS, and HDRP(ANLS) polypeptides having a lower degree of identity 
but having sufficient similarity so as to perform one or more of the same functions 
performed by an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9aANLS, or 
HDRP(ANLS) polypeptide encoded by a nucleic acid molecule of the invention. 
Similarity is determined by conserved amino acid substitution. Such substitutions 

10 are those that substitute a given amino acid in a polypeptide by another amino acid 
of like characteristics. Conservative substitutions are likely to be phenotypically 
silent. Typically seen as conservative substitutions are the replacements, one for 
another, among the aliphatic amino acids Ala, Val, Leu, and He; interchange of the 
hydroxyl residues Ser and Thr; exchange of the acidic residues Asp and Glu; 

1 5 substitution between the amide residues Asn and Gin; exchange of the basic residues 
Lys and Arg; and replacements among the aromatic residues Phe and Tyr. Guidance 
concerning which amino acid changes are likely to be phenotypically silent are found 
in Bowie et ah, Science 247: 1306-1310 (1990). 

A variant polypeptide can differ in amino acid sequence by one or more 

20 substitutions, deletions, insertions, inversions, fusions, and truncations or a 

combination of any of these. Further, variant polypeptides can be fully functional or 
can lack function in one or more activities, for example, in histone deacetylase 
activity or transcription repression activity. Fully functional variants typically 
contain only conservative variation or variation in non-critical residues or in 

25 non-critical regions. Functional variants can also contain substitution of similar 
amino acids that result in no change or an insignificant change in function. 
Alternatively, such substitutions may positively or negatively affect function to some 
degree. Non-functional variants typically contain one or more non-conservative 
amino acid substitutions, deletions, insertions, inversions, or truncations or a 

30 substitution, insertion, inversion, or deletion in a critical residue or critical region, 
such critical regions include the HDAC domains, which provide the polypeptide 
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with deacetylase activity, as shown in the nucleic acid sequences ofFIGS. 1A-1G, as 
well as in the schematic of FIG. 4. 

Amino acids that are essential for function can be identified by methods 
known in the art, such as site-directed mutagenesis or alanine-scanning mutagenesis 
5 (Cunningham et al, Science, 244: 1081-1085 (1989)). The latter procedure 
introduces a single alanine mutation at each of the residues in the molecule (one 
mutation per molecule). The resulting mutant molecules are then tested for 
biological activity in vitro. Sites that are critical for polypeptide activity can also be 
determined by structural analysis, such as crystallization, nuclear magnetic 
10 resonance, or photoaffinity labeling (See Smith et al., J. Mol. Biol., 224: 899-904 
(1992); and de Vos etal. Science, 255: 306-312 (1992)). 

The invention also includes HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), and HDRP(ANLS) polypeptide fragments of the polypeptides of 
the invention. Fragments can be derived from a polypeptide comprising SEQ ID 
15 NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ ID NO: 10, or from 
a polypeptide encoded by a nucleic acid molecule comprising SEQ ID NO: 1, SEQ 
ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, or SEQ ID NO: 9 or a portion thereof and 
the complements thereof or other variants. The present invention also encompasses 
fragments of the variants of the polypeptides described herein. Useful fragments 
20 include those that retain one or more of the biological activities of the polypeptide as 
well as fragments that can be used as an immunogen to generate polypeptide-specific 
antibodies. 

Biologically active fragments (peptides that are, for example, 6, 9, 12, 15, 16, 
20,30,35,36,37,38,39,40,50, 100, or more amino acids in length) can comprise ' 

25 a domain, segment, or motif, for example, an HDAC domain, that has been 

identified by analysis of the polypeptide sequence using well-known methods, e.g., 
signal peptides, extracellular domains, one or more transmembrane segments or 
loops, ligand binding regions, zinc finger domains, DNA binding domains, acylation 
sites, glycosylate sites, or phosphorylation sites. 

30 Fragments can be discrete (not fused to other amino acids or polypeptides) or 

can be within a larger polypeptide. Further, several fragments can be comprised 
within a single larger polypeptide. In one embodiment a fragment designed for 
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expression in a host can have heterologous pre- and pro-polypeptide regions fused to 
the amino terminus of the polypeptide fragment and an additional region fused to the 
carboxyl terminus of the fragment. 

The invention thus provides chimeric or fusion polypeptides. These 
5 comprise an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9aANLS, or HDRP(ANLS) 
polypeptide of the invention operatively linked to a heterologous protein or 
polypeptide having an amino acid sequence not substantially homologous to the 
polypeptide. "Operatively linked" indicates that the polypeptide and the 
heterologous protein are fused in-frame. The heterologous protein can be fused to 

10 the N-terminus or C-terminus of the polypeptide. In one embodiment, the fusion 
polypeptide does not affect the function of the polypeptide per se. For example, the 
fusion polypeptide can be a GST-fusion polypeptide in which the polypeptide 
sequences are fused to the C-terminus of the GST sequences. Other types of fusion 
polypeptides include, but are not limited to, enzymatic fusion polypeptides, for 

15 example, p-galactosidase fusions, yeast two-hybrid GAL fusions, poly-His fusions, 
and Ig fusions. Such fusion polypeptides, particularly poly-His fusions, can 
facilitate the purification of recombinant polypeptide. In certain host cells (e.g., 
mammalian host cells), expression and/or secretion of a polypeptide can be 
increased by using a heterologous signal sequence. Therefore, in another 

20 embodiment, the fusion polypeptide contains a heterologous signal sequence at its 
N-terminus. 

EP-A 0464 533 discloses fusion proteins comprising various portions of 
immunoglobulin constant regions. The Fc is useful in therapy and diagnosis and 
thus results, for example, in improved pharmacokinetic properties (EP-A 0232 262). 

25 In drug discovery, for example, human proteins have been fused with Fc portions for 
the purpose of high-throughput screening assays to identify antagonists. (See 
Bennett et al. 9 Journal of Molecular Recognition, 8: 52-58 (1995) and Johanson et 
al 9 The Journal of Biological Chemistry, 270,16: 9459-9471 (1995)). Thus, this 
invention also encompasses soluble fusion polypeptides containing a polypeptide of 

30 the invention and various portions of the constant regions of heavy or light chains of 
immunoglobulins of various subclass (IgG, IgM, IgA, IgE). 
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A chimeric or fusion polypeptide can be produced by standard recombinant 
DNA techniques. For example, DNA fragments coding for the different polypeptide 
sequences are ligated together in-frame in accordance with conventional techniques. 
In another embodiment, the fusion gene can be synthesized by conventional 
5 techniques including automated DNA synthesizers. Alternatively PCR 

amplification of nucleic acid fragments can be carried out using anchor primers that 
give rise to complementary overhangs between two consecutive nucleic acid 
fragments that can subsequently be annealed and re-amplified to generate a chimeric 
nucleic acid sequence (see Ausubel et ah, "Current Protocols in Molecular Biology," 
10 John Wiley & Sons, (1998), the entire teachings of which are incorporated by 
reference herein). Moreover, many expression vectors are commercially available 
that already encode a fusion moiety (e.g., a GST protein). A nucleic acid molecule 
encoding a polypeptide of the invention can be cloned into such an expression vector 
such that the fusion moiety is linked in-frame to the polypeptide. 
1 5 The substantially pure, isolated, or substantially pure and isolated HDAC9, 

HDAC9a, HDAC9(ANLS), HDAC9aANLS, or HDRP(ANLS) polypeptide can be 
purified from cells that naturally express it, purified from cells that have been altered 
to express it (recombinant), or synthesized using known protein synthesis methods. 
In one embodiment, the polypeptide is produced by recombinant DNA techniques. 
20 For example, a nucleic acid molecule encoding the polypeptide is cloned into an 
expression vector, the expression vector introduced into a host cell, and the 
polypeptide expressed in the host cell. The polypeptide can then be isolated from 
the cells by an appropriate purification scheme using standard protein purification 
techniques. 

25 In general, HDAC9, HDAC9a, HDAC9(ANLS), HDAC9aANLS, and 

HDRP(ANLS) polypeptides of the present invention can be used as a molecular 
weight marker on SDS-PAGE gels or on molecular sieve gel filtration columns 
using art-recognized methods. The polypeptides of the present invention can be 
used to raise antibodies or to elicit an immune response. The polypeptides can also 

30 be used as a reagent, e.g., a labeled reagent, in assays to quantitatively determine 
levels of the polypeptide or a molecule to which it binds (e.g., a receptor or a ligand) 
in biological fluids. The polypeptides can also be used as markers for cells or tissues 
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in which the corresponding polypeptide is preferentially expressed, either 
constitutively, during tissue differentiation, or in a diseased state. The polypeptides 
can be used to isolate a corresponding binding agent, and to screen for peptide or 
small molecule antagonists or agonists of the binding interaction. The polypeptides 
5 of the present invention can also be used as therapeutic agents. 

NUCLEIC ACID MOLECULES OF THE INVENTION 

The present invention also features isolated HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), and HDRP(ANLS) nucleic acid molecules. 

10 By a "histone deacetylase nucleic acid molecule" is meant a nucleic acid 

molecule that encodes a histone deacetylase polypeptide. Such histone nucleic acids 
include, for example, the HDAC9, HDAC9a 9 HDA C9(ANLS), HDA C9a(ANLS), or 
HDRP(ANLS) nucleic acid molecule described in detail herein; an isolated nucleic 
acid comprising SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, or 

1 5 SEQ ID NO: 9; a complement of an isolated nucleic acid comprising SEQ ID NO: 1 , 
SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, or SEQ ID NO: 9; an isolated 
nucleic acid encoding a histone deacetylase polypeptide of SEQ ID NO: 2, SEQ ID 
NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ ID NO: 10; a complement of an 
isolated nucleic acid encoding a histone deacetylase polypeptide of SEQ ID NO: 2, 

20 SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ ID NO: 10; a nucleic acid 
that is hybridizeable under high stringency conditions to a nucleic acid molecule that 
encodes any of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, or SEQ ID NO: 8, or 
a complement thereof; a nucleic acid molecule that is hybridizeable under high 
stringency conditions to a nucleic acid comprising SEQ ID NO: 1, SEQ ID NO: 3, 

25 SEQ ID NO: 5, or SEQ ID NO: 7; and an isolated nucleic acid molecule that has at 
least 55%, more preferably, 60%, 65%, 70%, 75%, 80%, 85%, or 90%, and most 
preferably, 95% or 99% sequence identity with any one of SEQ ID NO: 1, SEQ ID 
NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, or a complement thereof. 

An HDAC9 nucleic acid molecule is a nucleic acid molecule that encodes an 

30 HDAC9 polypeptide. In one embodiment, the HDAC9 nucleic acid molecule is 
selected from: a nucleic acid molecule that comprises the nucleic acid sequence of 
SEQ ID NO: 1; a complement of an isolated nucleic acid comprising SEQ ID NO: 1; 
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an isolated nucleic acid encoding a histone deacetylase polypeptide of SEQ ID NO: 
2; a complement of an isolated nucleic acid encoding a histone deacetylase 
polypeptide of SEQ ID NO: 2; a nucleic acid that is hybridizeable under high 
stringency conditions to a nucleic acid molecule that encodes SEQ ID NO: 2; a 
5 nucleic acid molecule that is hybridizeable under high stringency conditions to a 
nucleic acid comprising SEQ ID NO: 1 ; and an isolated nucleic acid molecule that 
has preferably, at least 55%, more preferably, 60%, 65%, 70%, 75%, 80%, 85%, or 
90%, and most preferably, 95% or 99% sequence identity with SEQ ID NO: 1, as 
determined using the BLAST program and parameters described herein. In another 
10 embodiment, the HDAC9 nucleic acid molecule consists of the nucleic acid 
sequence of SEQ ID NO: 1 . 

An HDAC9a nucleic acid molecule is a nucleic acid molecule that encodes 
an HDAC9a polypeptide. An HDAC9a nucleic acid molecule preferably has at least 
55%, sequence identity to SEQ ID NO: 3, In one embodiment, the HDAC9a nucleic 
15 acid molecule is selected from: a nucleic acid molecule that comprises the nucleic 
acid sequence of SEQ ID NO: 3; a complement of an isolated nucleic acid 
comprising SEQ ID NO: 3; an isolated nucleic acid encoding a histone deacetylase 
polypeptide of SEQ ID NO: 4; a complement of an isolated nucleic acid encoding a 
histone deacetylase polypeptide of SEQ ID NO: 4; a nucleic acid that is 
20 hybridizeable under high stringency conditions to a nucleic acid molecule that 
encodes SEQ ID NO: 4; a nucleic acid molecule that is hybridizeable under high 
stringency conditions to a nucleic acid comprising SEQ ID NO: 3; and an isolated 
nucleic acid molecule that has preferably, at least 55%, more preferably, 60%, 65%, 
70%, 75%, 80%, 85%, or 90%, and most preferably, 95% or 99% sequence identity 
25 with SEQ ID NO: 3 or a complement thereof, as determined using the BLAST 
program and parameters described herein. In another embodiment, the HDAC9a 
nucleic acid molecule consists of the nucleic acid sequence of SEQ ID NO: 3. 

An HDAC9(ANLS) nucleic acid molecule is a nucleic acid molecule that 
encodes an HDAC9(ANLS) polypeptide. In one embodiment, the HDAC9(ANLS) 
30 nucleic acid molecule is selected from: a nucleic acid molecule that comprises the 
nucleic acid sequence of SEQ ID NO: 5; a complement of an isolated nucleic acid 
comprising SEQ ID NO: 5; an isolated nucleic acid encoding a histone deacetylase 
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polypeptide of SEQ ID NO: 6; a complement of an isolated nucleic acid encoding a 
histone deacetylase polypeptide of SEQ ID NO: 6; a nucleic acid that is 
hybridizeable under high stringency conditions to a nucleic acid molecule that 
encodes SEQ ID NO: 6; a nucleic acid molecule that is hybridizeable under high 
5 stringency conditions to a nucleic acid comprising SEQ ID NO: 5; and an isolated 
nucleic acid molecule that has preferably, at least 55%, more preferably, 60%, 65%, 
70%, 75%, 80%, 85%, or 90%, and most preferably, 95% or 99% sequence identity 
with SEQ ID NO: 5 or a complement thereof, as determined using the BLAST 
program and parameters described herein. In another embodiment, the 
10 HDAC9(ANLS) nucleic acid molecule consists of the nucleic acid sequence of SEQ 
ID NO: 5. 

An HDAC9a(ANLS) nucleic acid molecule is a nucleic acid molecule that 
encodes an HDAC9a(ANLS) polypeptide. In one embodiment, the HDAC9a(ANLS) 
nucleic acid molecule is selected from: a nucleic acid molecule that comprises the 

1 5 nucleic acid sequence of SEQ ID NO: 7; a complement of an isolated nucleic acid 
comprising SEQ ID NO: 7; an isolated nucleic acid encoding a histone deacetylase 
polypeptide of SEQ ID NO: 8; a complement of an isolated nucleic acid encoding a 
histone deacetylase polypeptide of SEQ ID NO: 8; a nucleic acid that is 
hybridizeable under high stringency conditions to a nucleic acid molecule that 

20 encodes SEQ ID NO: 8; a nucleic acid molecule that is hybridizeable under high 
stringency conditions to a nucleic acid comprising SEQ ID NO: 7; and an isolated 
nucleic acid molecule that has preferably, at least 55%, more preferably, 60%, 65%, 
70%, 75%, 80%, 85%, or 90%, and most preferably, 95% or 99% sequence identity 
with SEQ ID NO: 7 or a complement thereof, as determined using the BLAST 

25 program and parameters described herein. In another embodiment, the 

HDAC9a(ANLS) nucleic acid molecule consists of the nucleic acid sequence of SEQ 
ID NO: 7. 

An "HDRP(ANLS) nucleic acid molecule" is a nucleic acid molecule that 
encodes an HDRP(ANLS) polypeptide. In one embodiment, the HDRP(ANLS) 
30 nucleic acid molecule is selected from: a nucleic acid molecule that comprises the 
nucleic acid sequence of SEQ ID NO: 9; a complement of an isolated nucleic acid 
comprising SEQ ID NO: 9; an isolated nucleic acid encoding a histone deacetylase 
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polypeptide of SEQ ID NO: 10; a complement of an isolated nucleic acid encoding a 
histone deacetylase polypeptide of SEQ ID NO: 10; and an isolated nucleic acid 
molecule that has preferably, at least 55%, more preferably, 60%, 65%, 70%, 75%, 
80%, 85%, or 90%, and most preferably, 95% or 99% sequence identity with SEQ 
5 ID NO: 9 or a complement thereof, as determined using the BLAST program and 
parameters described herein. . In another embodiment, the HDRP(ANLS) nucleic 
acid molecule consists of the nucleic acid sequence of SEQ ID NO: 9. 

The isolated nucleic acid molecules of the present invention can be RNA, for 
example, mRNA, or DNA, such as cDNA and genomic DNA DNA molecules can 
10 be double-stranded or single-stranded; single stranded RNA or DNA can be either 
the coding, or sense, strand or the non-coding, or antisense, strand. The nucleic acid 
molecule can include all or a portion of the coding sequence of the gene and can 
further comprise additional non-coding sequences such as introns and non-coding 3" 
and 5' sequences (including regulatory sequences, for example). Additionally, the 
1 5 nucleic acid molecule can be fused to a marker sequence, for example, a sequence 
that encodes a polypeptide to assist in isolation or purification of the polypeptide. 
Such sequences include, but are not limited to, those that encode a 
glutathione-S-transferase (GST) fusion protein and those that encode a 
hemagglutinin A (HA) polypeptide marker from influenza. 
20 An "isolated," "substantially pure," or "substantially pure and isolated" 

nucleic acid molecule, as used herein, is one that is separated from nucleic acids that 
normally flank the gene or nucleotide sequence (as in genomic sequences) and/or has 
been completely or partially purified from other transcribed sequences (e.g., as in an 
RNA or cDNA library). For example, an isolated nucleic acid of the invention may 
25 be substantially isolated with respect to the complex cellular milieu in which it 
naturally occurs, or culture medium when produced by recombinant techniques, or 
chemical precursors or other chemicals when chemically synthesized In some 
instances, the isolated material will form part of a composition (for example, a crude 
extract containing other substances), buffer system, or reagent mix. In other 
30 circumstances, the material may be purified to essential homogeneity, for example, 
as determined by agarose gel electrophoresis or column chromatography such \ 



i as 
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HPLC. Preferably, an isolated nucleic acid molecule comprises at least about 50, 80, 
or 90% (on a molar basis) of all macromolecular species present. 

With regard to genomic DNA, the term "isolated" also can refer to nucleic 
acid molecules that are separated from the chromosome with which the genomic 
5 DNA is naturally associated. For example, the isolated nucleic acid molecule can 
contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb, or 0.1 kb of nucleotides 
that flank the nucleic acid molecule in the genomic DNA of the cell from which the 
nucleic acid molecule is derived. 

The HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or HDRP(ANLS) 

10 nucleic acid molecule can be fused to other coding or regulatory sequences and still 
be considered isolated. Thus, recombinant DNA contained in a vector is included in 
the definition of "isolated" as used herein. Also, isolated nucleic acid molecules 
include recombinant DNA molecules in heterologous host cells, as well as partially 
or substantially purified DNA molecules in solution. "Isolated" nucleic acid 

15 molecules also encompass in vivo and in vitro RNA transcripts of the DNA 

molecules of the present invention. An isolated nucleic acid molecule or nucleotide 
sequence can include a nucleic acid molecule or nucleotide sequence that is 
synthesized chemically or by recombinant means. Therefore, recombinant DNA 
contained in a vector are included in the definition of "isolated" as used herein. 

20 Isolated nucleotide molecules also include recombinant DNA molecules in 

heterologous organisms, as well as partially or substantially purified DNA molecules 
in solution. In vivo and in vitro RNA transcripts of the DNA molecules of the 
present invention are also encompassed by "isolated" nucleotide sequences. Such 
isolated nucleotide sequences are useful in the manufacture of the encoded 

25 polypeptide, as probes for isolating homologous sequences (e.g., from other 
mammalian species), for gene mapping (e.g., by in situ hybridization with 
chromosomes), or for detecting expression of the gene in tissue (e.g., human tissue), 
such as by Northern blot analysis. 

The present invention also pertains to variant HDAC9, HDAC9a, 

30 HDAC9(ANLS) 9 HDAC9a(ANLS), and HDRP(ANLS) nucleic acid molecules that are 
not necessarily found in nature but that encode an HDAC9, HD AC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide. Thus, for 
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example, DNA molecules that comprise a sequence that is different from the 
nahirally-occurring HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) nucleotide sequence but which, due to the degeneracy of the genetic 
code, encode an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
5 HDRP(ANLS) polypeptide of the present invention are also the subject of this 
invention. 

The invention also encompasses HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), and HDRP(ANLS) nucleotide sequences encoding portions 
(fragments), or encoding variant polypeptides such as analogues or derivatives of an 
10 HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 

polypeptide. Such variants can be naturally-occurring, such as in the case of allelic 
variation or single nucleotide polymorphisms, or non-naturally-occurring, such as 
those induced by various mutagens and mutagenic processes. Intended variations 
include, but are not limited to, addition, deletion, and substitution of one or more 
1 5 nucleotides that can result in conservative or non-conservative amino acid changes, 
including additions and deletions. Preferably, theHDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) nucleotide (and/or resultant 
amino acid) changes are silent or conserved; that is, they do not alter the 
characteristics or activity of the HDAC9, HDAC9a, HDAC9(ANLS), 
20 HDAC9a(ANLS), or HDRP(ANLS) polypeptide. In one preferred embodiment, the 
nucleotide sequences are fragments that comprise one or more polymorphic 
microsatellite markers. 

Other alterations of the HDAC9, HDAC9a, HDAC9(ANLS), 
HDA C9a(ANLS), or HDRP(ANLS) nucleic acid molecules of the invention can 
25 include, for example, labeling, methylation, internucleotide modifications such as 
uncharged linkages {e.g., methyl phosphonates, phosphotriesters, phosphoamidates, 
and carbamates), charged linkages (e.g., phosphorothioates or phosphorodithioates), 
pendent moieties (e.g., polypeptides), intercalators (e.g., acridine or psoralen), 
chelators, alkylators, and modified linkages (e.g., alpha anomeric nucleic acids). 
30 Also included are synthetic molecules that mimic nucleic acid molecules in the 
ability to bind to a designated sequences via hydrogen bonding and other chemical 
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interactions. Such molecules include, for example, those in which peptide linkages 
substitute for phosphate linkages in the backbone of the molecule. 

The invention also pertains to HDAC9, HDAC9a, HDAC9(ANLS), 
HDA C9a ( ANLS) and HDRP(AMLS) nucleic acid molecules that hybridize under 
5 high stringency hybridization conditions, such as for selective hybridization, to a 
nucleotide sequence described herein (e.g., nucleic acid molecules that specifically 
hybridize to a nucleotide sequence encoding polypeptides described herein, and, 
optionally, have an activity of the polypeptide). In one embodiment, the invention 
includes variants described herein that hybridize under high stringency hybridization 
10 conditions (e.g., for selective hybridization) to a nucleotide sequence comprising a 
nucleotide sequence selected from SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, 
SEQ ID NO: 7, SEQ ID NO: 9 and the complement of SEQ ID NO: 1, SEQ ID NO: 
3, SEQ ID NO: 5, SEQ D NO: 7, or SEQ ID NO: 9. In another embodiment, the 
invention includes variants described herein that hybridize under high stringency 

15 hybridization conditions (e.g., for selective hybridization) to a nucleotide sequence 
encoding an amino acid sequence of SEQ ID NO: 2 (HDAC9), SEQ ID NO: 4 
(HDAC9a), SEQ ID NO: 6 (HDAC9(ANLS)), SEQ ID NO: 8 (HDAC9a(ANLS)), or 
SEQ ID NO: 10 (HDRP(ANLS)). In a preferred embodiment, the variant that 
hybridizes under high stringency hybridizations encodes a polypeptide that has a 

20 biological activity of an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide (e.g., histone deacetylase activity or transcription 
repression activity). 

Such nucleic acid molecules can be detected and/or isolated by specific 
hybridization (e.g., under high stringency conditions). "Specific hybridization/ 5 as 

25 used herein, refers to the ability of a first nucleic acid to hybridize to a second 
nucleic acid in a manner such that the first nucleic acid does not hybridize to any 
nucleic acid other than to the second nucleic acid (e.g., when the first nucleic acid 
has a higher similarity to the second nucleic acid than to any other nucleic acid in a 
sample wherein the hybridization is to be performed). "Stringency conditions" for 

30 hybridization is a term of art that refers to the incubation and wash conditions, e.g., 
conditions of temperature and buffer concentration, that permit hybridization of a 
particular nucleic acid to a second nucleic acid; the first nucleic acid may be 
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perfectly (i.e., 100%) complementary to the second, or the first and second may 
share some degree of complementarity that is less than perfect (e.g., 70%, 75%, 
85%, 95%). For example, certain high stringency conditions can be used that 
distinguish perfectly complementary nucleic acids fiom those of less 
5 complementarity. "High stringency conditions," "moderate stringency conditions," 
and "low stringency conditions" for nucleic acid hybridizations are explained on 
pages 2.10.1-2.10.16 and pages 6.3.1-6.3.6 in Current Protocols in Molecular 
Biology (See Ausubel et al., supra, the entire teachings of which are incorporated by 
reference herein). The exact conditions that determine the stringency of 
10 hybridization depend not only on ionic strength (e.g., 0.2XSSC or 0.1XSSC), 
temperature (e.g., room temperature, 42°C or 68°C), and the concentration of 
destabilizing agents such as fonnamide or denaturing agents such as SDS, but also 
on factors such as the length of the nucleic acid sequence, base composition, percent 
mismatch between hybridizing sequences, and the frequency of occurrence of 
1 5 subsets of that sequence within other non-identical sequences. Thus, equivalent 
conditions can be determined by varying one or more of these parameters while 
maintaining a similar degree of identity or similarity between the two nucleic acid 
molecules. Typically, conditions are used such that sequences at least about 60%, at 
least about 70%, at least about 80%, at least about 90% or at least about 95% or 
20 more identical to each other remain hybridized to one another. By varying 

hybridization conditions from a level of stringency at which no hybridization occurs 
to a level at which hybridization is first observed, conditions that will allow a given 
sequence to hybridize (e.g., selectively) with the most similar sequences in the 
sample can be determined. 
25 Exemplary conditions are described in Krause and Aaronson, Methods in 

Enzymology, 200:546-556 (1991). Also, in, Ausubel, et al, supra, which describes 
the determination of washing conditions for moderate or low stringency conditions. 
Washing is the step in which conditions are usually set so as to determine a 
minimum level of complementarity of the hybrids. Generally, starting from the 
30 lowest temperature at which only homologous hybridization occurs, each °C by 
which the final wash temperature is reduced (holding SSC concentration constant) 
allows an increase by 1 % in the maximum extent of mismatching among the 
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sequences that hybridize. Generally, doubling the concentration of SSC results in an 
increase in Tm of 17°C. Using these guidelines, the washing temperature can be 
determined empirically for high, moderate, or low stringency, depending on the level 
of mismatch sought. 

5 For example, a low stringency wash can comprise washing in a solution 

containing 0.2XSSC/0.1% SDS for 10 minutes at room temperature; a moderate 
stringency wash can comprise washing in a prewarmed solution (42°C) solution 
containing 0.2XSSC/0.1% SDS for 15 minutes at 42°C; and a high stringency wash 
can comprise washing in prewarmed (68°C) solution containing 0.1XSSC/0.1%SDS 

10 for 1 5 minutes at 68°C. Furthermore, washes can be performed repeatedly or ! 

sequentially to obtain a desired result as known in the art. Equivalent conditions can 
be determined by varying one or more of the parameters given as an example, as 
known in the art, while maintaining a similar degree of identity or similarity between 
the target nucleic acid molecule and the primer or probe used. 

15 To determine the percent homology or identity of two nucleic acid 

sequences, the sequences are aligned for optimal comparison purposes (e.g., gaps 
can be introduced in the sequence of one polypeptide or nucleic acid molecule for 
optimal alignment with the other polypeptide or nucleic acid molecule). The amino 
acid residues or nucleotides at corresponding amino acid positions or nucleotide 

20 positions are then compared, as described above. 

The present invention also provides isolated HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), and HDRP(ANLS) nucleic acid molecules that 
contain a fragment or portion that hybridizes under highly stringent conditions to a 
nucleotide sequence comprising a nucleotide sequence selected from SEQ ID NO: 1, 

25 SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, and the complement 
of any of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, or SEQ ID 
NO: 9 and also provides isolated nucleic acid molecules that contain a fragment or 
portion that hybridizes under highly stringent conditions to a nucleotide sequence 
encoding an amino acid sequence selected from SEQ ID NO: 2, SEQ ID NO: 4, SEQ 

30 ID NO: 6, SEQ ID NO: 8, and SEQ ID NO: 10. The nucleic acid fragments of the 
invention are at least about 15, preferably, at least about 18, 20, 23, or 25 
nucleotides, and can be 30, 40, 50, 100, 200 or more nucleotides in length. Longer 
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fragments, for example, 30 or more nucleotides in length, that encode antigenic 
polypeptides described herein are particularly useful, such as for the generation of 
antibodies as described above. 

In a related aspect, the HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), 
5 and HDRP(ANLS) nucleic acid fragments of the invention are used as probes or 
primers in assays such as those described herein. "Probes" or "primers" are 
oligonucleotides that hybridize in a base-specific manner to a complementary strand 
of nucleic acid molecules. Such probes and primers include polypeptide nucleic 
acids, as described in Nielsen et ah, Science, 254, 1497-1500 (1991). As also used 
1 0 herein, the term "primer" in particular refers to a single-stranded oligonucleotide that 
acts as a point of initiation of template-directed DNA synthesis using well-known 
methods (e.g., PCR, LCR) including, but not limited to those described herein. 

Typically, a probe or primer comprises a region of nucleotide sequence that 
hybridizes to at least about 15, typically about 20-25, and more typically about 40, 
15 50 or 75, consecutive nucleotides of a nucleic acid molecule comprising a 

contiguous nucleotide sequence selected from: SEQ ID NO: 1, SEQ ID NO: 3, SEQ 
ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, the complement of any of SEQ ID NO: 1, 
SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, and a sequence 
encoding an amino acid sequence of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, 
20 SEQ ID NO: 8, or SEQ ID NO: 10. 

In preferred embodiments, a probe or primer comprises 100 or fewer 
nucleotides, preferably, from 6 to 50 nucleotides, and more preferably, from 12 to 30 
nucleotides. In other embodiments, the probe or primer is at least 70% identical to 
the contiguous nucleotide sequence or to the complement of the contiguous 
25 nucleotide sequence, preferably, at least 80% identical, more preferably, at least 90% 
identical, even more preferably, at least 95% identical, or even capable of selectively 
hybridizing to the contiguous nucleotide sequence or to the complement of the 
contiguous nucleotide sequence. Often, the probe or primer further comprises a 
label, e.g., radioisotope, fluorescent compound, enzyme, or enzyme co-factor. 
30 The nucleic acid molecules of the invention such as those described above 

can be identified and isolated using standard molecular biology techniques and the 
sequence information provided in SEQ ID NO: 1, SEQ ID NO; 3, SEQ ID NO: 5, 
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SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, 
SEQ ID NO: 8, and /or SEQ ID NO: 10. For example, nucleic acid molecules can 
be amplified and isolated by the polymerase chain reaction using synthetic 
oligonucleotide primers designed based on one or more of the nucleic acid 
5 sequences provided above and/or the complement of those sequences. Or such 
nucleic acid molecules may be designed based on nucleotide sequences encoding 
one or more of the amino acid sequences provided in SEQ ID NO: 2, SEQ ID NO: 4, 
SEQ ID NO: 6, SEQ ID NO: 8, or SEQ ID NO: 10. See generally PCR Technology: 
Principles and Applications for DNA Amplification (ed. H. A. Erlich, Freeman Press, 

10 NY, NY, (1992); PCR Protocols: A Guide to Methods and Applications (Eds. Innis 
et al. 9 Academic Press, San Diego, CA, (1990); Mattila et al. 9 Nucleic Acids Res., 
19: 4967 (1991); Eckert et al, PCR Methods and Applications, 1 : 17 (1991); PCR 
(eds. McPherson et al, IRL Press, Oxford)); and U.S. Patent No. 4,683,202. The 
nucleic acid molecules can be amplified using cDNA, mRNA, or genomic DNA as a 

15 template, cloned into an appropriate vector and characterized by DNA sequence 
analysis. 

Other suitable amplification methods include the ligase chain reaction (LCR) 
(See Wu and Wallace, Genomics, 4:560 (1989), Landegren et al 9 Science, 241 : 1077 
(1988)), transcription amplification (Kwoh et al 9 Proc. Natl. Acad. Sci. USA, 

20 86:1 173 (1989)), and self-sustained sequence replication (See Guatelli et ah, Proc. 
Nat. Acad. Sci. USA, 87:1874 (1990)) and nucleic acid based sequence 
amplification (NASBA). The latter two amplification methods involve isothermal 
reactions based on isothermal transcription, that produce both single stranded RNA 
(ssRNA) and double stranded DNA (dsDNA) as the amplification products in a ratio 

25 of about 30 or 100 to 1, respectively. 

The amplified DNA can be radiolabeled and used as a probe for screening a 
cDNA library derived from human cells, mRNA in zap express, ZIPLOX, or other 
suitable vector. Corresponding clones can be isolated, DNA can be obtained 
following in vivo excision, and the cloned insert can be sequenced in either or both 

30 orientations by art-recognized methods to identify the correct reading frame 

encoding a polypeptide of the appropriate molecular weight. For example, the direct 
analysis of the nucleotide sequence of nucleic acid molecules of the present 
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invention can be accomplished using well-known methods that are commercially 
available. See, for example, Sambrook et al, Molecular Cloning, A Laboratory 
Manual (2nd Ed., CSHP, New York (1989)); Zyskind et al, Recombinant DNA 
Laboratory Manual, (Acad. Press, (1988)). Using these or similar methods, the 
5 polypeptide and the DNA encoding the polypeptide can be isolated, sequenced, and 
further characterized. 

Antisense nucleic acid molecules of the invention can be designed using the 
nucleotide sequences ofSEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID 
NO: 7, SEQ ID NO: 9 and/or the complement of any of SEQ ID NO: 1, SEQ ID NO: 
10 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9 and/or a portion of those 

sequences, and/or the complement of those portion or sequences, and/or a sequence 
encoding the amino acid sequence of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, 
SEQ ID NO: 8, SEQ ID NO: 10, or encoding a portion of SEQ ID NO: 2, SEQ ID 
NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ K> NO: 10. Such antisense nucleic 
15 acid molecules can be constructed using chemical synthesis and enzymatic ligation 
reactions using procedures known in the art. For example, an antisense nucleic acid 
molecule (e.g., an antisense oligonucleotide) can be chemically synthesized using 
naturally occurring nucleotides or variously modified nucleotides designed to 
increase the biological stability of the molecules or to increase the physical stability 
20 of the duplex formed between the antisense and sense nucleic acids, e.g., 

phosphorothioate derivatives and acridine substituted nucleotides can be used. 
Alternatively, the antisense nucleic acid molecule can be produced biologically using 
an expression vector into which a nucleic acid molecule has been subcloned in an 
antisense orientation (i.e., RNA transcribed from the inserted nucleic acid molecule 
25 will be of an antisense orientation to a target nucleic acid of interest). 

In general, the isolated HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), 
and HDRP(ANLS) nucleic acid sequences of the invention can be used as molecular 
weight markers on Southern blots, and as chromosome markers that are labeled to 
map related gene positions. The nucleic acid sequences can also be used to compare 
30 with endogenous DNA sequences in patients to identify genetic disorders (e.g., a 
predisposition for or susceptibility to a cell proliferation disease, an apoptotic 
disease, or a cell differentiation disease), and as probes, such as to hybridize and 
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discover related DNA sequences or to subtract out known sequences from a sample. 
The nucleic acid molecules of the present invention can also be used as therapeutic 
agents. 

By a "cell proliferation disease" is meant a disease that is caused by or results 
5 in undesirably high levels of cell division, undesirably low levels of apoptosis, or 
both. For example, cancers such as lymphoma, leukemia, melanoma, ovarian 
cancer, breast cancer, pancreatic cancer, prostate cancer, colon cancer, and lung 
cancer are all examples of cell proliferation diseases. Myeloproliferative disorders, 
including polycythemia vera, essential thrombocythemia, agnogenic myeloid 

10 metaplasia, and chronic myelogenous leukemia are also cell proliferation diseases. 

By a "cell differentiation disease" is meant a disease that is caused by or 
results in undesirably low levels of cell differentiation, or by undesirably high levels 
of cell differentiation. For example, cancers such as lymphoma, leukemia, 
melanoma, ovarian cancer, breast cancer, pancreatic cancer, prostate cancer, colon 

1 5 cancer, and lung cancer are all examples of cell differentiation diseases. 
Myeloproliferative disorders, including polycythemia vera, essential 
thrombocythemia, agnogenic myeloid metaplasia, and chronic myelogenous 
leukemia are also cell differentiation diseases. 

By an "apoptotic disease" is meant a condition in which the apoptotic 

20 response is abnormal. This may pertain to a cell or a population of cells that does 
not undergo cell death under appropriate conditions. For example, normally a cell 
will die upon exposure to apoptotic-triggering agents, such as chemotherapeutic 
agents, or ionizing radiation. When, however, a subject has an apoptotic disease, for 
example, cancer, the cell or a population of cells may not undergo cell death in 

25 response to contact with apoptotic-triggering agents. In addition, a subject may have 
an apoptotic disease when the occurrence of cell death is too low, for example, when 
the number of proliferating cells exceeds the number of cells undergoing cell death, 
as occurs in cancer when such cells do not properly differentiate. 

An apoptotic disease may also be a condition characterized by the occurrence 

30 of undesirably high levels of apoptosis. For example, certain neurodegenerative 
diseases, including but not limited to Alzheimer's disease, Parkinson's disease, 
amyotrophic lateral sclerosis, multiple sclerosis, restenosis, stroke, and ischemic 
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brain injury are apoptotic diseases in which neuronal cells undergo undesired cell 
death. 

Other diseases for which the polypeptides and nucleic acid molecules of the 
present invention may be useful for diagnosing and/or treating include, but are not 
5 limited to Huntington's disease. 

The HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), and 
HDRP(ANLS) nucleic acid molecules of the present invention can further be used to 
derive primers for genetic fingerprinting, to raise anti-polypeptide antibodies using 
DNA immunization techniques, and as an antigen to raise anti-DNA antibodies or 
10 elicit immune responses. Portions or fragments of the nucleotide sequences 

identified herein (and the corresponding complete gene sequences) can be used in 
numerous ways as polynucleotide reagents. For example, these sequences can be 
used to: (i) map their respective genes on a chromosome; and, thus, locate gene 
regions associated with genetic disease; (ii) identify an individual from a minute 
15 biological sample (tissue typing); and (iii) aid in forensic identification of a 
biological sample. 

In addition, the HDAC9, HDAC9a, HDAC9(ANLS), EDA C9a(ANLS), and 
HDRP(ANLS) nucleotide sequences of the invention can be used to identify and 
express recombinant polypeptides for analysis, characterization, or therapeutic use, 

20 or as markers for tissues in which the corresponding polypeptide is expressed, either 
constitutively, during tissue differentiation, or in diseased states. The nucleic acid 
sequences can additionally be used as reagents in the screening and/or diagnostic 
assays described herein, and can also be included as components of kits (eg., 
reagent kits) for use in the screening and/or diagnostic assays described herein. 

25 Standard techniques, such as the polymerase chain reaction (PCR) and DNA 

hybridization, may be used to clone HDAC9, HDAC9a, HDAC9(£NLS), 
HDAC9a(ANLS), oiHDRP(ANLS) homologs in other species, for example, 
mammalian homologs. HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS) > or 
HDRP(ANLS) homologs may be readily identified using low-stringency DNA 

30 hybridization or low-stringency PCR with human HDAC9, HDAC9a, 

HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) probes or primers. Degenerate 
primers encoding human HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
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HDRP(ANLS) polypeptides may be used to clone HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) homologs by RT-PCR. 

Alternatively, additional HDAC9, HDAC9a, HDAC9(ANLS), 
HDA C9a( ANLS) , or HDRP(ANLS) homologs can be identified by utilizing 
5 consensus sequence information for HDAC9, HDAC9a, HDAC9(ANLS), 

HDAC9a(ANLS), or HDRP(ANLS) polypeptides to search for similar polypeptides 
in other species. For example, polypeptide databases for other species can be 
searched for proteins with the HDAC domains described herein. Candidate 
polypeptides containing such a motif can then be tested for their HDAC9, HDAC9a, 
10 HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) biological activities, using 
methods described herein. 

EXPRESSION OF THE NUCLEIC ACID MOLECULES OF THE INVENTION 

Another aspect of the invention pertains to nucleic acid constructs containing 

15 an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) nucleic 
acid molecule, for example, one selected from the group consisting of SEQ ID NO: 
1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, and the 
complement of any of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 
7, or SEQ ID NO: 9 (or portions thereof). Yet another aspect of the invention 

20 pertains to HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), and HDRP(ANLS) 
nucleic acid constructs containing a nucleic acid molecule encoding the amino acid 
sequence of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ 
ID NO: 10. The constructs comprise a vector (e.g., an expression vector) into which 
a sequence of the invention has been inserted in a sense or antisense orientation. 

25 As used herein, the term "vector"' or "construct" refers to a nucleic acid 

molecule capable of transporting another nucleic acid to which it has been linked. 
One type of vector is a "plasmid," which refers to a circular double stranded DNA 
loop into which additional DNA segments can be ligated. Another type of vector is 
a viral vector, wherein additional DNA segments can be ligated into the viral 

30 genome. Certain vectors are capable of autonomous replication in a host cell into 
which they are introduced (e.g., bacterial vectors having a bacterial origin of 
replication and episomal mammalian vectors). Other vectors (e.g., non-episomal 
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mammalian vectors) are integrated into the genome of a host cell upon introduction 
into the host cell, and thereby are replicated along with the host genome. Moreover, 
certain vectors, expression vectors, are capable of directing the expression of genes 
to which they are operably linked, hi general, expression vectors of utility in 
5 recombinant DNA techniques are often in the form of plasmids. However, the 
invention is intended to include such other forms of expression vectors, such as viral 
vectors (e.g., replication defective retroviruses, adenoviruses and adeno-associated 
viruses) that serve equivalent functions. 

Preferred recombinant expression vectors of the invention comprise a nucleic 
10 acid molecule of the invention in a form suitable for expression of the nucleic acid 
molecule in a host cell. This means that the recombinant expression vectors include 
one or more regulatory sequences, selected on the basis of the host cells to be used 
for expression, which is operably linked to the nucleic acid sequence to be 
expressed. Within a recombinant expression vector, "operably linked" is intended to 
15 mean that the nucleotide sequence of interest is linked to the regulatory sequence® 
in a manner that allows for expression of the nucleotide sequence (e.g., in an in vitro 
transcription/translation system or in a host cell when the vector is introduced into 
the host cell). The term "regulatory sequence" is intended to include promoters, 
enhancers and other expression control elements (e.g., polyadenylation signals). 
20 Such regulatory sequences are described, for example, in Goeddel, Gene Expression 
Technology: Methods in Enzymology 185, Academic Press, San Diego, CA (1990). 
Regulatory sequences include those that direct constitutive expression of a 
nucleotide sequence in many types of host cell and those that direct expression of the 
nucleotide sequence only in certain host cells (e.g., tissue-specific regulatory 
25 sequences). 

It will be appreciated by those skilled in the art that the design of the 
expression vector can depend on such factors as the choice of the host cell to be 
transformed and the level of expression of polypeptide desired. The expression 
vectors of the invention can be introduced into host cells to thereby produce 
30 polypeptides, including fusion polypeptides, encoded by nucleic acid molecules as 
described herein. 
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The recombinant expression vectors of the invention can be designed for 
expression of a polypeptide of the invention in prokaryotic or eukaryotic cells, e.g., 
bacterial cells, such as E. coli, insect cells (using baculovirus expression vectors), 
yeast cells or mammalian cells. Suitable host cells are discussed further in Goeddel, 
5 supra. Alternatively, the recombinant expression vector can be transcribed and 
translated in vitro, for example, using T7 promoter regulatory sequences and T7 
polymerase. 

Another aspect of the invention pertains to host cells into which a 
recombinant expression vector of the invention has been introduced. The terms 

10 "host cell" and "recombinant host cell" are used interchangeably herein. It is 

understood that such terms refer not only to the particular subject cell but also to the 
progeny or potential progeny of such a cell. Because certain modifications may 
occur in succeeding generations due to. either mutation or environmental influences, 
such progeny may not, in fact, be identical to the parent cell, but are still included 

15 within the scope of the term as used herein. 

A host cell can be any prokaryotic or eukaryotic cell. For example, a nucleic 
acid molecule of the invention can be expressed in bacterial cells (e.g. t 'K coli), 
insect cells, yeast, or mammalian cells (such as Chinese hamster ovary cells (CHO) 
or COS cells, human 293T cells, HeLa cells, NTH 3T3 cells, and mouse 

20 erythroleukemia (MEL) cells). Other suitable host cells are known to those skilled 
in the art. 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via 
conventional transformation or transfection techniques. As used herein, the terms 
"transformation" and "transfection" are intended to refer to a variety of 

25 art-recognized techniques for introducing a foreign nucleic acid molecule (e.g., 
DNA) into a host cell, including calcium phosphate or calcium chloride 
co-precipitation, DEAE-dextran-mediated transfection, lipofection, or 
electroporation. Suitable methods for transforming or transfecting host cells can be 
found in Sambrook, et ah (supra), and other laboratory manuals. 

30 For stable transfection of mammalian cells, it is known that, depending upon 

the expression vector and transfection technique used, only a small fraction of cells 
may integrate the foreign DNA into their genome. In order to identify and select 
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these integrants, a gene that encodes a selectable marker (e.g., for resistance to 
antibiotics) is generally introduced into the host cells along with the gene of interest. 
Preferred selectable markers include those that confer resistance to drugs, such as 
G418, hygromycin, or methotrexate. Nucleic acid molecules encoding a selectable 
5 marker can be introduced into a host cell on the same vector as the nucleic acid 
molecule of the invention or can be introduced on a separate vector. Cells stably 
transfected with the introduced nucleic acid molecule can be identified by drug 
selection (e.g., cells that have incorporated the selectable marker gene will survive, 
while the other cells die). 
1 0 A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 

culture, can be used to produce (i.e., express) a polypeptide of the invention. 
Accordingly, the invention further provides methods for producing a polypeptide 
using the host cells of the invention. In one embodiment, the method comprises 
culturing the host cell of invention (into which a recombinant expression vector 
1 5 encoding a polypeptide of the invention has been introduced) in a suitable medium 
such that the polypeptide is produced. In another embodiment, the method further 
comprises isolating the polypeptide from the medium or the host cell. 

The host cells of the invention can also be used to produce nonhuman 
transgenic animals. For example, in one embodiment, a host cell of the invention is 
20 a fertilized oocyte or an embryonic stem cell into which an HDA C9, HDA C9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) nucleic acid molecule of the 
invention has been introduced. Such host cells can then be used to create 
non-human transgenic animals in which exogenous nucleotide sequences have been 
introduced into the genome or homologous recombinant animals in which 
25 endogenous nucleotide sequences have been altered. Such animals are useful for 
studying the function and/or activity of the nucleotide sequence and polypeptide 
encoded by the sequence and for identifying and/or evaluating modulators of then- 
activity. 

As used herein, a "transgenic animal" is a non-human animal, preferably, a 
30 mammal, more preferably, a rodent such as a rat or mouse, in which one or more of 
the cells of the animal includes a transgene. Other examples of transgenic animals 
include non-human primates, sheep, dogs, cows, goats, chickens, and amphibians. A 
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transgene is exogenous DNA that is integrated into the genome of a cell from which 
a transgenic animal develops and that remains in the genome of the mature animal, 
thereby directing the expression of an encoded gene product in one or more cell 
types or tissues of the transgenic animal. As used herein, a "homologous 
5 recombinant animal" is a non-human animal, preferably, a mammal, more 

preferably, a mouse, in which an endogenous gene has been altered by homologous 
recombination between the endogenous gene and an exogenous DNA molecule 
introduced into a cell of the animal, e.g., an embryonic cell of the animal, prior to 
development of the animal. 

10 Methods for generating transgenic animals via embryo manipulation and 

microinjection, particularly animals such as mice, have become conventional in the 
art and are described, for example, in U.S. Patent Nos. 4,736,866 and 4,870,009, 
U.S. Patent No. 4,873,191, and in Hogan, Manipulating the Mouse Embryo (Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., (1986)). Methods for 

1 5 constructing homologous recombination vectors and homologous recombinant 
animals are described further in Bradley, Current Opinion in Bio/Technology, 
2:823-829 (1991) and in PCT Publication Nos. WO 90/1 1354, WO 91/01 140, WO 
92/0968, and WO 93/04169. Clones of the non-human transgenic animals described 
herein can also be produced according to the methods described in Wilmut et aL 9 

20 Nature, 385:810-813 (1997) and PCT Publication Nos. WO 97/07668 and WO 
97/07669. 

ANTIBODIES OF THE INVENTION 

Polyclonal and/or monoclonal antibodies that selectively bind one form of an 

25 HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 

polypeptide but not another form of the polypeptide are also provided. Antibodies 
are also provided that bind a portion of either the variant or reference HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide that 
contains the polymorphic site or sites. 

30 In another aspect, the invention provides antibodies to each of the HDAC9, 

HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), and HDRP(ANLS) polypeptides and 
polypeptide fragments of the invention, having an amino acid sequence encoded 
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by SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, 
or a portion thereof, or having an amino acid sequence encoded by a nucleic acid 
molecule comprising all or a portion of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 
5, SEQ ID NO: 7, or SEQ ID NO: 9, (e.g, SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID 
5 NO: 6, SEQ ID NO: 8, or SEQ ID NO: 10, or another variant, or portion thereof). 
The term "purified antibody" as used herein refers to immunoglobulin 
molecules and immunologically active portions of immunoglobulin molecules, i.e. 9 
molecules that contain an antigen binding site that selectively binds an antigen. A 
molecule that selectively binds to a polypeptide of the invention is a molecule that 
10 binds to that polypeptide or a fragment thereof, but does not substantially bind other 
molecules in a sample, e.g., a biological sample that naturally contains the 
polypeptide. Preferably the antibody is at least 60%, by weight, free from proteins 
and naturally occurring organic molecules with which it naturally associated. More 
preferably, the antibody preparation is at least 75% or 90%, and most preferably, 
1 5 99%, by weight, antibody. Examples of immunologically active portions of 
immunoglobulin molecules include F(ab) and F(ab')2 fragments that can be 
generated by treating the antibody with an enzyme such as pepsin. 

The invention provides polyclonal and monoclonal antibodies that selectively 
bind to an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
20 polypeptide of the invention. The term "monoclonal antibody" or "monoclonal 
antibody composition," as used herein, refers to a population of antibody molecules 
that contain only one species of an antigen binding site capable of immunoreacting 
with a particular epitope of a polypeptide of the invention. A monoclonal antibody 
composition thus typically displays a single binding affinity for a particular 
25 polypeptide of the invention with which it immunoreacts. 

Polyclonal antibodies can be prepared as described above by immunizing a 
suitable subject with a desired immunogen, e.g., an HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide of the invention or 
fragment thereof. The antibody titer in the immunized subject can be monitored 
30 over time by standard techniques, such as with an enzyme linked immunosorbent 
assay (ELISA) using immobilized polypeptide. If desired, the antibody molecules 
directed against the polypeptide can be isolated from the mammal (e.g, from the 
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blood) and further purified by well-known techniques, such as protein A 
chromatography to obtain the IgG fraction. 

At an appropriate time after immunization, e.g., when the antibody titers are 
highest, antibody-producing cells can be obtained from the subject and used to 
5 prepare monoclonal antibodies by standard techniques, such as the hybridoma 

technique originally described by Kohler and Milstein, Nature, 256:495-497 (1975), 
the human B cell hybridoma technique (Kozbor et aL, Immunol. Today, 4:72 
(1983)), the EBV-hybridoma technique (Cole et aL, Monoclonal Antibodies and 
Cancer Therapy, Alan R. Liss, Inc., pp. 77-96 (1985)) or trioma techniques. The 

1 0 technology for producing hybridomas is well known (see generally Current Protocols 
in Immunology, Coligan et aL, (eds.) John Wiley & Sons, Inc., New York, NY 
(1994)). Briefly, an immortal cell line (typically a myeloma) is fused to lymphocytes 
(typically splenocytes) from a mammal immunized with an immunogen as described 
above, and the culture supernatants of the resulting hybridoma cells are screened to 

1 5 identify a hybridoma producing a monoclonal antibody that binds a polypeptide of 
the invention. 

Any of the many well known protocols used for fusing lymphocytes and 
immortalized cell lines can be applied for the purpose of generating a monoclonal 
antibody to a polypeptide of the invention (see, e.g., Current Protocols in 

20 Immunology, supra; Galfre et aL, (1977) Nature, 266:55052; R.H. Kenneth, in 
Monoclonal Antibodies: A New Dimension In Biological Analyses, Plenum 
Publishing Corp., New York, New York (1980); and Lerner, Yale J. Biol. Med., 
54:387-402 (1981)). Moreover, the ordinarily skilled worker will appreciate that 
there are many variations of such methods that also would be useful. 

25 Alternative to preparing monoclonal antibody-secreting hybridomas, a 

monoclonal antibody to an HDAC9, HDAC9a, HD AC9(ANLS), HDAC9a(ANLS), 
or HDRP(ANLS) polypeptide of the invention can be identified and isolated by 
screening a recombinant combinatorial immunoglobulin library (e.g, an antibody 
phage display library) with the polypeptide to thereby isolate immunoglobulin 

30 library members that bind the polypeptide. Kits for generating and screening phage 
display libraries are commercially available (e.g, the Pharmacia Recombinant Phage 
Antibody System, Catalog No. 27-9400-01; and the Stratagene SurfZAP™ Phage 
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Display Kit, Catalog No. 240612). Additionally, examples of methods and reagents 
particularly amenable for use in generating and screening antibody display library 
can be found in, for example, U.S. Patent No. 5,223,409; PCT Publication No. WO 
92/18619; PCT Publication No. WO 91/17271; PCT Publication No. WO 92/20791; 
5 PCT Publication No. WO 92/15679; PCT Publication No. WO 93/01288; PCT 
Publication No. WO 92/01047; PCT Publication No. WO 92/09690; PCT 
Publication No. WO 90/02809; Fuchs et al, Bio/Technology, 9:1370-1372 (1991); 
Hayetal., Hum. Antibod. Hybridomas, 3:81-85 (1992); HuseetaL, Science, 
246:1275-1281 (1989); and Griffiths etal, EMBO J., 12:725-734 (1993). 
10 Additionally, recombinant antibodies, such as chimeric and humanized 

monoclonal antibodies, comprising both human and non-human portions, which can 
be made using standard recombinant DNA techniques, are within the scope of the 
invention. Such chimeric and humanized monoclonal antibodies can be produced by 
recombinant DNA techniques known in the art. 
15 In general, antibodies of the invention (e.g., a monoclonal antibody) can be 

used to isolate an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide of the invention by standard techniques, such as affinity 
chromatography or immunoprecipitation. A polypeptide-specific antibody can 
facilitate the purification of natural polypeptide from cells and of recombinantly 
20 produced polypeptide expressed in host cells. Moreover, an antibody specific for an 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide of the invention can be used to detect the polypeptide (e.g., in a cellular 
lysate, cell supernatant, or tissue sample) in order to evaluate the abundance and 
pattern of expression of the polypeptide. 
25 The antibodies of the present invention can also be used diagnostically to 

monitor protein levels in tissue as part of a clinical testing procedure, e.g., to, for 
example, determine the efficacy of a given treatment regimen. Detection can be 
facilitated by coupling the antibody to a detectable substance. Examples of 
detectable substances include various enzymes, prosthetic groups, fluorescent 
30 materials, luminescent materials, bioluminescent materials, and radioactive 

materials. Examples of suitable enzymes include horseradish peroxidase, alkaline 
phosphatase, p-galactosidase, and acetylcholinesterase; examples of suitable 
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prosthetic group complexes include streptavidin/biotin and avidin/biotin; examples 
of suitable fluorescent materials include umbelliferone, fluorescein, fluorescein 
isothiocyanate, rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride and 
phycoerythrin; an example of a luminescent material includes luminol; examples of 
5 bioluminescent materials include luciferase, luciferin, and aequorin, and examples of 
suitable radioactive material include 125 1, 131 1, 35 S, and 3 H. 

DIAGNOSTIC AND SCREENING ASSAYS OF THE INVENTION 

The present invention also pertains to diagnostic assays for assessing HDAC 

10 9 HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) gene expression, or 
for assessing activity of HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptides of the invention. In one embodiment, the assays are 
used in the context of a biological sample (e.g., blood, serum, cells, tissue) to 
thereby determine whether an individual is afflicted with a cell proliferation disease, 

15 an apoptotic disease, or a cell differentiation disease, or is at risk for (has a 

predisposition for or a susceptibility to) developing a cell proliferation disease, an 
apoptotic disease, or a cell differentiation disease. The invention also provides for 
prognostic (or predictive) assays for determining whether an individual is 
susceptible to developing a cell proliferation disease, an apoptotic disease, or a cell 

20 differentiation disease. For example, mutations in the HDAC9, HDAC9a, 

HDAC9(ANLS), HDA C9a(ANLS), or HDRP(ANLS) nucleic acid molecule can be 
assayed in a biological sample. Such assays can be used for prognostic or predictive 
purpose to thereby prophylactically treat an individual prior to the onset of 
symptoms associated with a cell proliferation disease, an apoptotic disease, or a cell 

25 differentiation disease. 

Another aspect of the invention pertains to assays for monitoring the 
influence of agents, or candidate compounds (e.g., drugs or other agents) on the 
nucleic acid molecule expression or biological activity of polypeptides of the 
invention, as well as to assays for identifying candidate compounds that bind to an 

30 HDAC9, HDAC9a polypeptide, an HDAC9(ANLS) polypeptide, an 

HDAC9a(ANLS) polypeptide, or an HDRP(ANLS) polypeptide. These and other 
assays and agents are described in further detail in the following sections. 
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DIAGNOSTIC ASSAYS 

HDAC9, HDAC9a, HDAC9( ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
nucleic acid molecules, probes, primers, polypeptides, and antibodies to an HDAC9, 
5 an HDAC9a protein, an HDAC9(ANLS) protein, an HDAC9a(ANLS) protein, or an 
HDRP(ANLS) protein can be used in methods of diagnosis of a susceptibility to, or 
likelihood of having a cell proliferation disease, an apoptotic disease, or a cell 
differentiation disease, as well as in kits useful for diagnosis of a susceptibility to a 
cell proliferation disease, an apoptotic disease, or a cell differentiation disease. 
10 In one embodiment of the invention, diagnosis of a decreased susceptibility 

to a cell proliferation disease, an apoptotic disease, or a cell differentiation disease is 
made by detecting a polymorphism in HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS). The polymorphism can be a mutation in 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS), such as the 
15 insertion or deletion of a single nucleotide, or of more than one nucleotide, resulting 
in a frame shift mutation; the change of at least one nucleotide, resulting in a change 
in the encoded amino acid; the change of at least one nucleotide, resulting in the 
generation of a premature stop codon; the deletion of several nucleotides, resulting 
in a deletion of one or more amino acids encoded by the nucleotides; the insertion of 
20 one or several nucleotides, such as by unequal recombination or gene conversion, 
resulting in an interruption of the coding sequence of the gene; duplication of all or a 
part of the gene; transposition of all or a part of the gene; or rearrangement of all or a 
part of the gene, or a change in the expression pattern of the various HDAC9 
isoforms. More than one such mutation may be present in a single nucleic acid 
25 molecule. 

Such sequence changes cause a mutation in the polypeptide encoded by 
HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or HDRP(ANLS). For 
example, if the mutation is a frame shift mutation, the frame shift can result in a 
change in the encoded amino acids, and/or can result in the generation of a 
30 premature stop codon, causing generation of a truncated polypeptide. Alternatively, 
a polymorphism associated with a decreased susceptibility to a cell proliferation 
disease, an apoptotic disease, or a cell differentiation disease can be a synonymous 
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mutation in one or more nucleotides (z.e., a mutation that does not result in a change 
in the HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide). Such a polymorphism may alter sites, affect the stability or transport 
of mRNA, or otherwise affect the transcription or translation of the nucleic acid 
5 molecule. HDAC9 9 HDAC9a 9 HDAC9(ANLS), HDAC9a(ANLS) 9 or HDRP(ANLS) 
that has any of the mutations described above is referred to herein as a "mutant 
nucleic acid molecule." 

In a first method of diagnosing a decreased susceptibility to a cell 
proliferation disease, an apoptotic disease, or a cell differentiation disease, 

10 hybridization methods, such as Southern analysis, Northern analysis, or in situ 
hybridizations, can be used (see Ausubel, et al 9 supra). For example, a biological 
sample from a test subject (a 'test sample") of genomic DNA, RNA, or cDNA, is 
obtained from an individual suspected of having, being susceptible to or predisposed 
for, or carrying a defect for, a cell proliferation disease, an apoptotic disease, or a 

15 cell differentiation disease (the 'test individual"). The individual can be an adult, 
child, or fetus. The test sample can be from any source that contains genomic DNA, 
such as a blood sample, sample of amniotic fluid, sample of cerebrospinal fluid, or 
tissue sample from skin, muscle, buccal or conjunctival mucosa, placenta, 
gastrointestinal tract, or other organs. A test sample of DNA from fetal cells or 

20 tissue can be obtained by appropriate methods, such as by amniocentesis or 

chorionic villus sampling. The DNA, RNA, or cDNA sample is then examined to 
determine whether a polymorphism in HDAC9 9 HDAC9a 9 HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) is present, and/or to determine which variant(s) 
encoded by HDAC9, HDAC9a 9 HDAC9(ANLS) 9 HDAC9a(ANLS) 9 or HDRP(ANLS) 

25 is present. The presence of the polymorphism or variant(s) can be indicated by 
hybridization of the gene in the genomic DNA, RNA, or cDNA to a nucleic acid 
probe. A "nucleic acid probe," as used herein, can be a DNA probe or an RNA 
probe; the nucleic acid probe can contain at least one polymorphism in HDAC9, 
HDAC9a 9 HDAC9(ANLS) 9 HDAC9a(ANLS) 9 or HDRP(AMLS) or contains a nucleic 

30 acid encoding a particular variant of HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS). The probe can be any of the nucleic acid 
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molecules described above (e.g., the entire nucleic acid molecule, a fragment, a 
vector comprising the gene, a probe, or primer, etc.). 

To diagnose a decreased susceptibility to a cell proliferation disease, an 
apoptotic disease, or a cell differentiation disease, a hybridization sample is formed 
5 by contacting the test sample containing HDAC9 , HDAC9a, HDAC9(ANLS), 

HDA C9a(ANLS), or HDRP(ANLS), with at least one nucleic acid probe. A preferred 
probe for detecting mRNA or genomic DNA is a labeled nucleic acid probe capable 
of hybridizing to HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or 
HDRP(ANLS) mRNA or genomic DNA sequences described herein. The nucleic 
10 acid probe can be, for example, a mil-length nucleic acid molecule, or a portion 
thereof such as an oligonucleotide of at least 15, 30, 50, 100, 250, or 500 
nucleotides in length and sufficient to specifically hybridize under stringent 
conditions to appropriate mRNA or genomic DNA. For example, the nucleic acid 
probe can be all or a portion of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ 
15 ID NO: 7, SEQ ID NO: 9, or the complement of SEQ ID NO: 1 or SEQ ID NO: 3, 
SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9; or can be a nucleic acid molecule' 
encoding all or a portion of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID 
NO: 8, or SEQ ID NO: 10. Other suitable probes for use in the diagnostic assays of 
the invention are described above (see. e.g., probes and primers discussed under the 
20 heading, "Nucleic Acids of the Invention"). 

The hybridization sample is maintained under conditions that are sufficient to 
allow specific hybridization of the nucleic acid probe to HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), otHDRP(*NLS). "Specific hybridization," as 
used herein, indicates exact hybridization (e.g., with no mismatches). Specific 
25 hybridization can be performed under high stringency conditions or moderate 
stringency conditions, for example, as described above. In a particularly preferred 
embodiment, the hybridization conditions for specific hybridization are high 
stringency. 

Specific hybridization, if present, is then detected using standard methods. If 
30 specific hybridization occurs between the nucleic acid probe and HDAC9, HDAC9a, 
HDAC9WLS), HDAC9a(ANLS), or HDRP(ANLS) in the test sample, then HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) has the 
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polymorphism, or is the variant, that is present in the nucleic acid probe. More than 
one nucleic acid probe can also be used concurrently in this method. Specific 
hybridization of any one of the nucleic acid probes is indicative of a polymorphism 
in HDAC9, HDAC9a, HDAC9(ANLS) 9 HDAC9a(ANLS), or HDRP(ANLS), or of the 
5 presence of a particular variant encoded by HDAC9, HDAC9a, HDAC9(ANLS) > 
HDAC9a(ANLS), or HDRP(ANLS), and is therefore diagnostic for a decreased 
susceptibility to a cell proliferation disease, an apoptotic disease, or a cell 
differentiation disease. 

In Northern analysis (see Current Protocols in Molecular Biology, Ausubel, 

10 et al , supra), the hybridization methods described above are used to identify the 
presence of a polymorphism or of a particular variant, associated with a decreased 
susceptibility to a cell proliferation disease, an apoptotic disease, or a cell 
differentiation disease. For Northern analysis, a test sample of RNA is obtained 
from the individual by appropriate means. Specific hybridization of a nucleic acid 

15 probe, as described above, to RNA from the individual is indicative of a 
polymorphism in HDAC9, HDAC9a i HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS), or of the presence of a particular variant encoded by HDAC9, 
HDAC9a, HDAC9(ANLS),HDAC9a(ANLS) 9 or HDRPfANLS), and is therefore 
diagnostic for a decreased susceptibility to a cell proliferation disease, an apoptotic 

20 disease, or a cell differentiation disease. 

For representative examples of use of nucleic acid probes, see, for example, 
U.S. Patent Nos. 5,288,611 and 4,851,330. 

Alternatively, a peptide nucleic acid (PNA) probe can be used instead of a 
nucleic acid probe in the hybridization methods described above. PNA is a DNA 

25 mimic having a peptide-like, inorganic backbone, such as N-(2-aminoethyl)glycine 
units, with an organic base (A, G, C, T, or U) attached to the glycine nitrogen via a 
methylene carbonyl linker (see, for example, Nielsen et ai, Bioconjugate Chemistry, 
5 (1994), American Chemical Society, p. 1 (1994)). The PNA probe can be 
designed to specifically hybridize to a gene having a polymorphism associated with 

30 a susceptibility to a cell proliferation disease, an apoptotic disease, or a cell 
differentiation disease. Hybridization of the PNA probe to HDAC9, HDAC9a 9 
HDAC9(ANLS), HDA C9a(ANLS) 9 or HDRP(ANLS) is diagnostic for a decreased 
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susceptibility to a cell proliferation disease, an apoptotic disease, or a cell 
differentiation disease. 

In another method of the invention, mutation analysis by restriction digestion 
can be used to detect a mutant nucleic acid molecule, or nucleic acid molecules 
5 containing a polymorphism(s), if the mutation or polymorphism in the gene results 
in the creation or elimination of a restriction site. A test sample containing genomic 
DNA is obtained from the individual. Polymerase chain reaction (PGR) can be used 
to amplify HDAC9, HDACPa, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
(and, if necessary, the flanking sequences) in the test sample of genomic DNA from 
10 the test individual. RFLP analysis is conducted as described (see Current Protocols 
in Molecular Biology, supra). The digestion pattern of the relevant DNA fragment 
indicates the presence or absence of the mutation or polymorphism mHDAC9, 
HDAC9a,HDAC9(ANLS),HDAC9a(ANLS), or ^r^, and therefore ' 
indicates the presence or absence of this decreased susceptibility to a cell 
15 proliferation disease, an apoptotic disease, or a cell differentiation disease. 

Sequence analysis can also be used to detect specific polymorphisms in 
HDAC9, HDAC9a, HDAC9(ANLS) > HDAC9a(ANLS), or HDRP(ANLS). A test 
sample of DNA or RNA is obtained from the test individual. PCR or other 
appropriate methods can be used to amplify the nucleic acid molecule, and/or its 
flanking sequences, if desired. The sequence of HDAC9, HDAC9a, HDAC9(ANLS) 
HDA C9a(ANLS), or HDRPfANLS), or HDRP(ANLS), or a fragment of the any of ' 
those nucleic acid molecules, or an HDAC9, HDAC9a, HDAC9(ANLS), 
HDA C9a(ANLS), or HDRP(ANLS) cDNA, or a fragment of any of those cDNAs, or 
anHDACP, HDACPa, HDAC9(ANLS), HDAC9a( ANLS), oxHDRP(ANLS) mRNA, 
25 or a fragment of any of those mRNAs, is determined, using standard methods. The 
sequence of the above gene, gene fragment, cDNA cDNA fragment, mRNA or 
mRNA fragment is compared with the known nucleic acid sequence of the nucleic 
acid molecule, cDNA (e.g., SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID 
NO: 7, SEQ ID NO: 9, or a nucleic acid sequence encoding the protein of SEQ ID 
NO: 2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO: 10, or a fragment 
thereof) or mRNA, as appropriate. The presence of a polymorphism in HDAC9, 
HDAC9a, HDA C9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) indicates that the 



20 



30 
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individual has a decreased susceptibility to a cell proliferation disease, an apoptotic 
disease, or a cell differentiation disease. 

Allele-specific oligonucleotides can also be used to detect the presence of a 
polymorphism in HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
5 HDRP(ANLS), through the use of dot-blot hybridization of amplified 

oligonucleotides with allele-specific oligonucleotide (ASO) probes (see, for 
example, Saiki et ah, Nature (London) 324:163-166 (1986)). An "allele-specific 
oligonucleotide" (also referred to herein as an "allele-specific oligonucleotide 
probe") is an oligonucleotide of approximately 10-50 base pairs, preferably 

10 approximately 15-30 base pairs, that specifically hybridizes to HDAC9, HDAC9a 9 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS), and that contains a 
polymorphism associated with a decreased susceptibility to a cell proliferation 
disease, an apoptotic disease, or a cell differentiation disease. An allele-specific 
oligonucleotide probe that is specific for particular polymorphisms in HDAC9, 

15 HDAC9a, HDAC9(ANLS) 9 HDA C9a (ANLS) , or HDRP(ANLS) can be prepared, 
using standard methods (see Current Protocols in Molecular Biology, supra). 

To identify polymorphisms in the gene that are associated with a decreased 
susceptibility to a cell proliferation disease, an apoptotic disease, or a cell 
differentiation disease a test sample of DNA is obtained from the individual. PCR 

20 can be used to amplify all or a fragment of HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS), and its flanking sequences. The DNA 
containing the amplified HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS) 9 or 
HDRP(ANLS) (or a fragment of any of those genes) is dot-blotted, using standard 
methods (see Current Protocols in Molecular Biology, supra), and the blot is 

25 contacted with the oligonucleotide probe. The presence of specific hybridization of 
the probe to the amplified HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) is then detected. Specific hybridization of an allele-specific 
oligonucleotide probe to DNA from the individual is indicative of a polymorphism 
in HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS), and is 

30 therefore indicative of a decreased susceptibility to a cell proliferation disease, an 
apoptotic disease, or a cell differentiation disease. 
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In another embodiment, arrays of oligonucleotide probes that are 
complementary to target nucleic acid sequence segments from an individual, can be 
used to identify polymorphisms in HDAC9, HDAC9a, HDAC9(ANLS), 
HDACPa(ANLS), or HDRP(ANLS). For example, in one embodiment, an 
5 oligonucleotide array can be used. Oligonucleotide arrays typically comprise a 
plurality of different oligonucleotide probes that are coupled to a surface of a 
substrate in different known locations. These oligonucleotide arrays, also described 
as "GENECHIPS™,» have been generally described in the art, for example, U.S. 
Patent No. 5,143,854 and PCT patent publication Nos. WO 90/15070 and 92/10092. 
1 0 These arrays can generally be produced using mechanical synthesis methods or light 
directed synthesis methods that incorporate a combination of photolithographic 
methods and solid phase oligonucleotide synthesis methods. See Fodor et al., 
Science, 251:161-111 (1991), Pirrung et al, U.S. Patent No. 5,143,854; PCT 
Publication No. WO 90/15070; Fodor et al, PCT Publication No. WO 92/1 0092, 
15 and U.S. Patent No. 5,424,186, the entire teachings of each of which are 

incorporated by reference herein. Techniques for the synthesis of these arrays using 
mechanical synthesis methods are described in, e.g., U.S. Patent No. 5,384,261, the 
entire teachings of which are incorporated by reference herein. 

Once an oligonucleotide array is prepared, a nucleic acid of interest is 
20 hybridizedtothearrayandscannedforpolymorphisms. Hybridization and scanning 
are generally carried out by methods described herein and also in, e.g., Published 
PCT Application Nos. WO 92/10092 and WO 95/1 1995, and U.S. Patent No. 
5,424,186, the entire teachings of which are incorporated by reference herein. In 
brief, a target nucleic acid sequence that includes one or more previously identified 
25 polymorphic markers is amplified by well known amplification techniques, e.g., 
PCR. Typically, this involves the use of primer sequences that are complementary 
to the two strands of the target sequence both upstream and downstream from the 
polymorphism. Asymmetric PCR techniques may also be used. Amplified target, 
generally incorporating a label, is then hybridized with the array under appropriate 
30 conditions. Upon completion of hybridization and washing of the array, the array is 
scanned to determine the position on the array to which the target sequence 
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hybridizes. The hybridization data obtained from the scan is typically in the form of 
fluorescence intensities as a function of location on the array. 

Although primarily described in terms of a single detection block, e.g., for 
detection of a single polymorphism, arrays can include multiple detection blocks, 
5 and thus be capable of analyzing multiple, specific polymorphisms. In alternate 
arrangements, it will generally be understood that detection blocks may be grouped 
within a single array or in multiple, separate arrays so that varying, optimal 
conditions may be used during the hybridization of the target to the array. For 
example, it may often be desirable to provide for the detection of those 

10 polymorphisms that fall within G-C rich stretches of a genomic sequence, separately 
from those falling in A-T rich segments. This allows for the separate optimization 
of hybridization conditions for each situation. 

Additional descriptions of the use of oligonucleotide arrays for detection of 
polymorphisms can be found, for example, in U.S. Patent Nos. 5,858,659 and 

1 5 5,837,832, the entire teachings of which are incorporated by reference herein. 

Other methods of nucleic acid analysis can be used to detect polymorphisms 
in HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) or 
variants encoded by HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a ( ANLS) , or 
HDRP(ANLS). Representative methods include direct manual sequencing (Church 

20 and Gilbert Proc. Natl. Acad. Sci. USA 81 : 1991-1995, (1988); Sanger et aL, Proc. 
Natl. Acad. Sci. 74: 5463-5467 (1977); Beavis et aL, U.S. Patent No. 5,288,644); 
automated fluorescent sequencing; single-stranded conformation polymorphism 
assays (SSCP); clamped denaturing gel electrophoresis (CDGE); denaturing gradient 
gel electrophoresis (DGGE) (Sheffield et aL, Proc. Natl. Acad. Sci. USA 86: 

25 232-236 (1991)), mobility shift analysis (Orita et aL, Proc. Natl. Acad. Sci. USA 86: 
2766-2770 (1989)), restriction enzyme analysis (Flavell et aL, Cell 15: 25 (1978); 
Geever, et aL, Proc. Natl. Acad. Sci. USA 78: 5081 (1981)); heteroduplex analysis; 
chemical mismatch cleavage (CMC) (Cotton et aL, Proc. Natl. Acad. Sci. USA 85: 
4397-4401 (1985)); RNase protection assays (Myers et aL, Science 230: 1242 

30 (1985)); use of polypeptides that recognize nucleotide mismatches, such as E. coli 
mutS protein; and allele-specific PCR. 
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In another embodiment of the invention, diagnosis of a susceptibility to a cell 
proliferation disease, an apoptotic disease, or a cell differentiation disease can also 
be made by examining the level of an HDA C9, HDAC9a, HDA C9(ANLS), 
HDA C9a(ANLS), or HDRP(ANLS) nucleic acid, for example, using in situ 
5 hybridization techniques known to one skilled in the art, or by examining the level of 
expression, activity, and/or composition of an HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) polypeptide, by a variety of methods, including 
enzyme linked immunosorbent assays (ELISAs), Western blots, 
immunoprecipitations, immunohistochemistry, and immunofluorescence. A test 
10 sample from an individual is assessed for the presence of an alteration in the level of 
an HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a( ANLS), or HDRP(ANLS) nucleic 
acid or in the expression and/or an alteration in composition of the polypeptide 
encoded by HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or HDRP(ANLS), 
or for the presence of a particular variant encoded by HDAC9, HDAC9a, 
15 HDAC9(ANLS), HDAC9a(ANLS), or HDRP( ANLS). An alteration in expression of a 
polypeptide encoded by HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or 
HDRP(ANLS) can be, for example, an alteration in the quantitative polypeptide 
expression {i.e., the amount of polypeptide produced); an alteration in the 
composition of a polypeptide encoded by HDAC9, HDAC9a, HDA C9(ANLS), 
20 HDA C9a(ANLS), or HDRP(ANLS), or an alteration in the qualitative polypeptide 
expression {e.g., expression of a mutant HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) polypeptide or variant thereof). In a preferred 
embodiment, diagnosis of a susceptibility to a cell proliferation disease, an apoptotic 
disease, or a cell differentiation disease is made by detecting a particular variant 
25 encoded by HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS), 
or a particular pattern of variants. Preferably, increased levels of HDAC9, HDAC9a, 
HDAC9(ANLS), HDA C9a(ANLS), or HDRP(ANLS) or increased expression or 
activity of anHDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide, relative to a control sample, for example, a sample 
30 known not to be associated with a cell proliferation disease, an apoptotic disease, or 
a cell differentiation disease, indicates an increased susceptibility or likelihood that 
the individual has a cell proliferation disease, an apoptotic disease, or a cell 
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differentiation disease. Alternatively, decreased levels of HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) or decreased expression or 
activity of an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide, relative to a control sample, for example, a sample 
5 known not to be associated with a cell proliferation disease, an apoptotic disease, or 
a cell differentiation disease, indicates a decreased susceptibility or likelihood that 
the individual has a cell proliferation disease, an apoptotic disease, or a cell 
differentiation disease. 

Both quantitative and qualitative alterations can also be present. An 

10 "alteration" or "modulation" in the polypeptide expression, activity, or composition, 
as used herein, refers to an alteration in expression or composition in a test sample, 
as compared with the expression or composition of HDAC9, HDAC9a, 
HD AC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide in a control 
sample. A control sample is a sample that corresponds to the test sample (e.g., is 

1 5 from the same type of cells), and is from an individual who is not affected by a cell 
proliferation disease, an apoptotic disease, or a cell differentiation disease. An 
alteration in the expression or composition of the polypeptide in the test sample, as 
compared with the control sample, is indicative of a decreased susceptibility to a cell 
proliferation disease, an apoptotic disease, or a cell differentiation disease. 

20 Similarly, the presence of one or more different variants in the test sample, or the 
presence of significantly different amounts of different variants in the test sample, as 
compared with the control sample, is indicative of a decreased susceptibility to a cell 
proliferation disease, an apoptotic disease, or a cell differentiation disease. 

It is understood that alterations or modulations in polypeptide expression or 

25 function can occur in varying degrees. For example, an alteration or modulation in 
expression can be an increase, for example, by at least 1.5-fold to 2-fold, at least 3- 
fold, or, at least 5-fold, relative to the control. Alternatively, the alteration or 
modulation in polypeptide expression can be a decrease, for example, by at least 
10%, at least 40%, 50%, or 75%, or by at least 90%, relative to the control. 

30 Various means of examining expression or composition of the HDAC9, 

HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide can be 
used, including spectroscopy, colorimetry, electrophoresis, isoelectric focusing, and 
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immunoassays (e.g., David etal, U.S. Patent No. 4,376,110) such as 
immunoblotting (see also Ausubel et al., supra; particularly chapter 10). For 
example, in one embodiment, an antibody capable of binding to the polypeptide 
(e.g., as described above), preferably an antibody with a detectable label, can be 
5 used. Antibodies can be polyclonal, or more preferably, monoclonal. An intact 
antibody, or a fragment thereof (e.g., Fab or F(ab')2) can be used. The term 
"labeled," with regard to the antibody, is intended to encompass direct labeling of 
the antibody by coupling (i.e., physically linking) a detectable substance to the 
antibody, as well as indirect labeling of the antibody by reacting it with another 
10 reagent that is directly labeled. An example of indirect labeling is detection of a 
primary antibody using a fluorescently labeled secondary antibody. 

Western blotting analysis, using an antibody as described above that 
specifically binds to a mutant HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) polypeptide, or an antibody that specifically 
1 5 binds to a non-mutant HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide, or an antibody that specifically binds to a particular 
variant encoded by HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS), can be used to identify the presence in a test sample of a particular 
variant of a polypeptide encoded by a polymorphic or mutant HDAC9, HDAC9a, 
20 HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS), or the absence in a test sample 
of a particular variant or of a polypeptide encoded by a non-polymorphic or 
non-mutant gene. The presence of a polypeptide encoded by a polymorphic or 
mutant gene, or the absence of a polypeptide encoded by a non-polymorphic or 
non-mutant gene, is diagnostic for a decreased susceptibility to a cell proliferation 
25 disease, an apoptotic disease, or a cell differentiation disease, as is the presence (or 
absence) of particular variants encoded by the HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) nucleic acid molecule. 

In one embodiment of this method, the level or amount of HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide in a test 
30 sample is compared with the level or amount of the HDAC9, HDAC9a, 

HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide in a control 
sample. A level or amount of the polypeptide in the test sample that is higher or 
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lower than the level or amount of the polypeptide in the control sample, such that the 
difference is statistically significant, is indicative of an alteration in the expression of 
the HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide, and is diagnostic for a decreased susceptibility to a cell proliferation 
5 disease, an apoptotic disease, or a cell differentiation disease. 

Alternatively, the composition of the HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) polypeptide in a test sample is compared with 
the composition of the HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide in a control sample. A difference in the composition of 

10 the polypeptide in the test sample, as compared with the composition of the 
polypeptide in the control sample (e.g., the presence of different variants), is 
diagnostic for a decreased susceptibility to a cell proliferation disease, an apoptotic 
disease, or a cell differentiation disease. In another embodiment, both the level or 
amount and the composition of the polypeptide can be assessed in the test sample 

15 and in the control sample. A difference in the amount or level of the polypeptide in 
the test sample, compared to the control sample; a difference in composition in the 
test sample, compared to the control sample; or both a difference in the amount or 
level, and a difference in the composition, is indicative of a decreased susceptibility 
to a cell proliferation disease, an apoptotic disease, or a cell differentiation disease. 

20 Kits (e.g., reagent kits) useful in the methods of diagnosis comprise 

components useful in any of the methods described herein, including, for example, 
hybridization probes or primers as described herein (e.g., labeled probes or primers), 
reagents for detection of labeled molecules, restriction enzymes (e.g., for RFLP 
analysis), allele-specific oligonucleotides, antibodies that bind to a mutant or to 

25 non-mutant (native) HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide, means for amplification of nucleic acids comprising 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS), or means 
for analyzing the nucleic acid sequence of HDAC9, HDAC9a 9 HDAC9(ANLS), 
HDAC9a(ANLS) 9 or HDRP(ANLS), or for analyzing the amino acid sequence of an 

30 HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide, etc. 
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SCREENING ASSAYS AND AGENTS IDENTIFIED THEREBY 

The invention provides methods (also referred to herein as "screening 
assays") for identifying the presence of a nucleotide that hybridizes to a nucleic acid 
of the invention, as well as for identifying the presence of a polypeptide encoded by 
5 a nucleic acid of the invention. In one embodiment, the presence (or absence) of a 
nucleic acid molecule of interest (e.g., a nucleic acid that has significant homology 
with a nucleic acid of HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS)) in a sample can be assessed by contacting the sample with a nucleic 
acid comprising a nucleic acid of the invention (e.g. 3 a nucleic acid having the 
10 sequence of SEQ ID NO: 1, SEQ ED NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, or SEQ 
ID NO : 9, which may optionally comprise at least one polymorphism, or the' 
complement thereof, or a nucleic acid encoding an amino acid having the sequence 
of SEQ ID NO: 2, SEQ ID NO:4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ ID NO: 
10, or a fragment or variant of such nucleic acids), under stringent conditions as 
15 described above, and then assessing the sample for the presence (or absence) of 
hybridization. In a preferred embodiment, high stringency conditions are conditions 
appropriate for selective hybridization. In another embodiment, a sample containing 
the nucleic acid molecule of interest is contacted with a nucleic acid containing a 
contiguous nucleotide sequence (e.g. , a primer or a probe as described above) that is 
20 at least partially complementary to a part of the nucleic acid molecule of interest 
(e.g., an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
nucleic acid), and the contacted sample is assessed for the presence or absence of 
hybridization. In a preferred embodiment, the nucleic acid containing a contiguous 
nucleotide sequence is completely complementary to a part of the nucleic acid 

25 moleculeof HDAC9,HDAC9a,HDAC9(ANLS),HDAC9a(ANLS), or 
HDRP(ANLS). 

In any of the above embodiments, all or a portion of the nucleic acid of 
interest can be subjected to amplification prior to performing the hybridization. 

In another embodiment, the presence (or absence) of an HDAC9, HDAC9a, 
30 HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide, such as a 
polypeptide of the invention or a fragment or variant thereof, in a sample can be 
assessed by contacting the sample with an antibody that specifically binds to the 
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polypeptide of HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) (e.g., an antibody such as those described above), and then assessing 
the sample for the presence (or absence) of binding of the antibody to the HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide. 
5 In another embodiment, the invention provides methods for identifying 

agents or compounds (e.g., fusion proteins, polypeptides, peptidomimetics, 
prodrugs, receptors, binding agents, antibodies, small molecules or other drugs, or 
ribozymes) that alter or modulate (e.g., increase or decrease) the activity of the 
polypeptides described herein, or that otherwise interact with the polypeptides 

10 herein. For example, such compounds can be compounds or agents that bind to 
polypeptides described herein (e.g., HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) substrates or agents); that have a stimulatory or 
inhibitory effect on, for example, activity of polypeptides of the invention; or that 
change (e.g., enhance or inhibit) the ability of the polypeptides of the invention to 

1 5 interact with HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 

HDRP(ANLS) binding agents; or that alter post-translational processing of the 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide (e.g., agents that alter proteolytic processing to direct the polypeptide 
from where it is normally synthesized to another location in the cell, such as the cell 

20 surface; or agents that alter proteolytic processing such that more polypeptide is 

released from the cell, etc.). In one example, the binding agent is a cell proliferation 
disease binding agent, an apoptotic disease binding agent, or a cell differentiation 
disease binding agent. As used herein, by a "cell proliferation disease binding 
agent," an "apoptotic disease binding agent/' or a "cell differentiation disease 

25 binding agent" is meant an agent as described herein that binds to a polypeptide of 
the present invention and modulates a cell proliferation disease, an apoptotic disease, 
or a cell differentiation disease. The modulation can be an increase or a decrease in 
the severity or progression of the disease. In addition, a cell proliferation disease 
binding agent, an apoptotic disease binding agent, or a cell differentiation disease 

30 binding agent includes an agent that binds to a polypeptide that is upstream (earlier) 
or downstream (later) of the cell signaling events mediated by a polypeptide of the 
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10 



present invention, and thereby modulates the overall activity of the signaling 
pathway; in turn, the disease state is modulated. 

The candidate compound can cause an increase in the activity of the 
polypeptide. For example, the activity of the polypeptide can be increased by at least 
1.5-fold to 2-fold, at least 3-fold, or, at least 5-fold, relative to the control. 
Alternatively, the polypeptide activity can be a decrease, for example, by at least 
10%, at least 20%, 40%, 50%, or 75%, or by at least 90%, relative to the control. 

In one embodiment, the invention provides assays for screening candidate 
compounds or test agents to identify compounds that bind to or modulate the activity 
of polypeptides described herein (or biologically active portions) thereof), as well as 
agents identifiable by the assays. As used herein, a "candidate compound" or "test 
agent" is a chemical molecule, be it naturaUy-occurring or artificially-derived, and 
includes, for example, peptides, proteins, synthesized molecules, for example,' 
synthetic organic molecules, naturally-occurring molecule, for example, naturally 
15 occurring organic molecules, nucleic acid molecules, and components thereof. 

In general, candidate compounds for uses in the present invention may be 
identified from large libraries of natural products or synthetic (or semi-synthetic) 
extracts or chemical libraries according to methods known in the art. Those skilled 
in the field of drug discovery and development will understand that the precise 
source of test extracts or compounds is not critical to the screening procedure(s) of 
the invention. Accordingly, virtually any number of chemical extracts or compounds 
can be screened using the exemplary methods described herein. Examples of such 
extracts or compounds include, but are not limited to, plant-, fungal-, prokaryotic- or 
animal-based extracts, fermentation broths, and synthetic compounds, as well as 
25 modification of existing compounds. Numerous methods are also available for 
generating random or directed synthesis (e.g., semi-synthesis or total synthesis) of 
any number of chemical compounds, including, but not limited to, saccharide-, 
lipid-, peptide-, and nucleic acid-based compounds. Synthetic compound libraries 
• are commercially available, e.g., from Brandon Associates (Merrimack, NH) and 
30 Aldrich Chemical (Milwaukee, WI). Alternatively, libraries of natural compounds 
m the form of bacterial, fungal, plant, and animal extracts are commercially 
available from a number of sources, including Biotics (Sussex, UK), Xenova 



20 
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(Slough, UK), Harbor Branch Oceangraphics Institute (Ft. Pierce, FL), and 
PharmaMar, U.S.A. (Cambridge, MA). In addition, natural and synthetically 
produced libraries are generated, if desired, according to methods known in the art, 
e.g., by standard extraction and fractionation methods. For example, candidate 
5 compounds can be obtained using any of the numerous approaches in combinatorial 
library methods known in the art, including: biological libraries; spatially 
addressable parallel solid phase or solution phase libraries; synthetic library methods 
requiring deconvolution; the "one-bead one-compound" library method; and 
synthetic library methods using affinity chromatography selection. The biological 

1 0 library approach is limited to polypeptide libraries, while the other four approaches 
are applicable to polypeptide, non-peptide oligomer or small molecule libraries of 
compounds (Lam, Anticancer Drug Des., 12: 145 (1997)). Furthermore, if desired, 
any library or compound is readily modified using standard chemical, physical, or 
biochemical methods. 

15 In addition, those skilled in the art of drug discovery and development 

readily understand that methods for dereplication (e.g., taxonomic dereplication, 
biological dereplication, and chemical dereplication, or any combination thereof) or 
the elimination of replicates or repeats of materials already known for their activities 
should be employed whenever possible. 

20 When a crude extract is found to modulate (i.e., stimulate or inhibit) the 

expression and/or activity of the nucleic acids and or polypeptides of the present 
invention, further fractionation of the positive lead extract is necessary to isolate 
chemical constituents responsible for the observed effect. Thus, the goal of the 
extraction, fractionation, and purification process is the careful characterization and 

25 identification of a chemical entity within the crude extract having an activity that 
stimulates or inhibits nucleic acid expression, polypeptide expression, or polypeptide 
biological activity. The same assays described herein for the detection of activities 
in mixtures of compounds can be used to purify the active component and to test 
derivatives thereof. Methods of fractionation and purification of such heterogenous 

30 extracts are known in the art. If desired, compounds shown to be useful agents for 
treatment are chemically modified according to methods known in the art. 
Compounds identified as being of therapeutic value may be subsequently analyzed 
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using animal models for diseases in which it is desirable to alter the activity or 
expression of the nucleic acids or polypeptides of the present invention. 

In one embodiment, to identify candidate compounds that alter the biological 
activity, for example, the enzymatic activity or transcriptional repression activity of 
5 an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide, a cell, tissue, cell lysate, tissue lysate, or solution containing or 
expressing an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide (e.g., SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SE 
ID NO: 8, SEQ ID NO: 10, or another variant encoded by HDAC9, HDAC9a, 
10 HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS)), or a fragment or derivative 
thereof (as described above), can be contacted with a candidate compound to be 
tested under conditions suitable for enzymatic reaction or transcriptional repression 
reaction, as described herein. 

Alternatively, the polypeptide can be contacted directly with the candidate 
compound to be tested. The level (amount) of HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) biological activity is assessed (e.g., the level 
(amount) of HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) biological activity is measured, either directly or indirectly), and is 
compared with the level of biological activity in a control {i.e., the level of activity 
of the HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide or active fragment or derivative thereof in the absence of the candidate 
compound to be tested, or in the presence of the candidate compound vehicle only). 
If the level of the biological activity in the presence of the candidate compound 
differs, by an amount that is statistically significant, from the level of the biological 
25 activity in the absence of the candidate compound, or in the presence of the 

candidate compound vehicle only, then the candidate compound is a compound that 
alters the biological activity of an HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) polypeptide. For example, an increase in the 
level of HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
30 enzymatic or transcriptional repression activity relative to a control, indicates that 
the candidate compound is a compound that enhances (is an agonist of) HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) activity. Similarly, 
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a decrease in the enzymatic level or transcriptional repression level of HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) activity relative to a 
control, indicates that the candidate compound is a compound that inhibits (is an 
antagonist of) HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
5 HDRP(ANLS) activity. In another embodiment, the level of biological activity of an 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide or derivative or fragment thereof in the presence of the candidate 
compound to be tested, is compared with a control level that has previously been 
established. A level of the biological activity in the presence of the candidate 

10 compound that differs from the control level by an amount that is statistically 

significant indicates that the compound alters HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) biological activity. 

The present invention also relates to an assay for identifying compounds that 
alter the expression of an HDA C9 f HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or 

15 HDRP(ANLS) nucleic acid molecule (e.g., antisense nucleic acids, fusion proteins, 
polypeptides, peptidomimetics, prodrugs, receptors, binding agents, antibodies, small 
molecules or other drugs, or ribozymes) that alter (e.g., increase or decrease) 
expression (e.g., transcription or translation) of the nucleic acid molecule or that 
otherwise interact with the nucleic acids described herein, as well as compounds 

20 identifiable by the assays. For example, a solution containing a nucleic acid 
encoding an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide can be contacted with a candidate compound to be tested. 
The solution can comprise, for example, cells containing the nucleic acid or cell 
lysate containing the nucleic acid; alternatively, the solution can be another solution 

25 that comprises elements necessary for transcription/translation of the nucleic acid. 
Cells not suspended in solution can also be employed, if desired. The level and/or 
pattern of HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
expression (e.g., the level and/or pattern of mRNA or of protein expressed, such as 
the level and/or pattern of different variants) is assessed, and is compared with the 

30 level and/or pattern of expression in a control (i.e., the level and/or pattern of 

HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or HDRP(ANLS) expression in 
the absence of the candidate compound, or in the presence of the candidate t 
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compound vehicle only). If the level and/or pattern in the presence of the candidate 
compound differs, by an amount or in a manner that is statistically significant, from 
the level and/or pattern in the absence of the candidate compound, or in the presence 
of the candidate compound vehicle only, then the candidate compound is a 
5 compound that alters the expression of HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(dNLS), oxHDRP(ANLS). Enhancement of HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDKPfANLS) expression indicates that the 
candidate compound is an agonist of HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDKP(ANLS) activity. Similarly, inhibition of HDAC9, 
10 HDAC9a, HDAC9(ANL S)) HDAC9a(ANLS), or HDRP(MLS) expression indicates 
that the candidate compound is an antagonist of HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) activity, hi another embodiment, the level 
and/or pattern of an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide^) (e.g., different variants) in the presence of the 
1 5 candidate compound to be tested, is compared with a control level and/or pattern that 
has previously been established. A level and/or pattern in the presence of the 
candidate compound that differs from the control level and/or pattern by an amount 
or in a manner that is statistically significant indicates that the candidate compound 
alters HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
20 expression. 

In another embodiment of the invention, compounds that alter the expression 
of an HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or HDRP(ANLS) 
nucleic acid molecule or that otherwise interact with the nucleic acids described 
herein, can be identified using a cell, cell lysate, or solution containing a nucleic 
acid encoding the promoter region of the HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) gene operably linked to a reporter gene. After 
contact with a candidate compound to be tested, the level of expression of the 
reporter gene (e.g., the level of mRNA or of protein expressed) is assessed, and is 
compared with the level of expression in a control (i.e., the level of the expression 
30 of the reporter gene in the absence of the candidate compound, or in the presence of 
the candidate compound vehicle only). If the level in the presence of the candidate 
compound differs, by an amount or in a manner that is statistically significant, from 
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the level in the absence of the candidate compound, or in the presence of the 
candidate compound vehicle only, then the candidate compound is a compound that 
alters the expression of HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS), as indicated by its ability to alter expression of a gene that is 
5 operably linked to the HDAC9, HDAC9a 9 HDAC9(ANLS) 9 HDAC9a(ANLS), or 
HDRP(ANLS) gene promoter. Enhancement of the expression of the reporter 
indicates that the compound is an agonist of HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) activity. Similarly, inhibition of the expression 
of the reporter indicates that the compound is an antagonist of HDAC9, HDAC9a, 

10 HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) activity. In another 

embodiment, the level of expression of the reporter in the presence of the candidate 
compound to be tested, is compared with a control level that has previously been 
established. A level in the presence of the candidate compound that differs from the 
control level by an amount or in a manner that is statistically significant indicates 

15 that the candidate compound alters HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) expression. 

Compounds that alter the amounts of different variants encoded by HDAC9, 
HDAC9a, HDA C9(ANLS) > HDAC9a(ANLS), or HDRP(ANLS) (e.g., a compound 
that enhances activity of a first variant, and that inhibits activity of a second variant), 

20 as well as compounds that are agonists of activity of a first variant and antagonists 
of activity of a second variant, can easily be identified using these methods 
described above. 

hi other embodiments of the invention, assays can be used to assess the 
impact of a candidate compound on the activity of a polypeptide in relation to an 

25 HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) substrate, 
for example, an inhibitor of histone deacetylase activity. These inhibitors fall into 
four general classes: 1) short-chain fatty acids (e.g., 4-phenylbutyrate and valproic 
acid); 2) hydroxamic acids (e.g., SAHA, Pyroxamide, trichostatin A (TSA), 
oxamflatin and CHAPs, such as, CHAP1 and CHAP 31); 3) cyclic tetrapeptides 

30 (Trapoxin A, Apicidin and Depsipeptide (FK-228, also known as FR901 1228); 4) 
benzamides (e.g., MS-275); and other compounds such as Scriptaid. Examples of 
such assays and compounds can be found in U.S. Patent Nos. 5,369,108, issued on 
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November 29, 1994, 5,700,81 1, issued on December 23, 1997, and 5,773,474, 
issued on June 30, 1998 to Breslow et al, U.S. Patent Nos. 5,055,608, issued on 
October 8, 1991, and 5,175,191, issued on December 29, 1992 to Marks et al, as 
well as, Yoshida et al, supra; Saito et al, supra; Furamai et al, supra; Komatsu et 
5 al , supra; Su et al, supra; Lee et al, supra and Suzuki et al. supra, the entire 
content of all of which are hereby incorporated by reference. 

In one example, a cell or tissue that expresses or contains a compound that 
interacts with HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) (herein referred to as an "HDAC9, HDAC9a, HDAC9(ANLS), 
10 HDAC9a(ANLS), or HDRP(ANLS) substrate," which can be a polypeptide or other 
molecule that interacts with HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), 
or HDRP(ANLS)) is contacted with HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) in the presence of a candidate compound, and 
the ability of the candidate compound to alter the interaction between HDAC9, 
1 5 HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) and the HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP (ANLS) substrate is 
determined, for example, by assaying activity of the polypeptide. Alternatively, a 
cell lysate or a solution containing the HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) substrate, can be used. A compound that binds 
20 to HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) or the 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) substrate 
can alter the interaction by interfering with, or enhancing the ability of HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) to bind to, associate 
with, or otherwise interact with the HDAC9, HDAC9a, HDAC9(ANLS), 
25 HDAC9a(ANLS), or HDRP(ANLS) substrate. 

Detennining the ability of the candidate compound to bind to HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) or an HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), orHDRP(ANLS) substrate can be 
accomplished, for example, by coupling the candidate compound with a 
30 radioisotope or enzymatic label such that binding of the candidate compound to the 
polypeptide can be determined by detecting the labeled with ,2S L 35 S, 14 C, or 3 H, 
either directly or indirectly, and the radioisotope detected by direct counting of 
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radioemmission or by scintillation counting. Alternatively, candidate compound can 
be enzymatically labeled with, for example, horseradish peroxidase, alkaline 
phosphatase, or luciferase, and the enzymatic label detected by determination of 
conversion of an appropriate substrate to product. 
5 It is also within the scope of this invention to determine the ability of a 

candidate compound to interact with the polypeptide without the labeling of any of 
the interactants. For example, a microphysiometer can be used to detect the 
interaction of a candidate compound with HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) or an HDAC9, HDAC9a, HDAC9(ANLS), 

1 0 HDAC9a(ANLS), or HDRP(ANLS) substrate without the labeling of either the 
candidate compound, HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS), or the HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) substrate (McConnell et al. 9 (1992) Science, 257: 1906-1912). As 
used herein, a "microphysiometer" (e.g., CYTOSENSOR™) is an analytical 

15 instrument that measures the rate at which a cell acidifies its environment using a 
light-addressable potentiometric sensor (LAPS). Changes in this acidification rate 
can be used as an indicator of the interaction between ligand and polypeptide. 

In another embodiment of the invention, assays can be used to identify 
polypeptides that interact with one or more HDAC9, HDAC9a, HDAC9(ANLS), 

20 HDAC9a(ANLS), or HDRP(ANLS) polypeptides, as described herein. For example, 
a yeast two-hybrid system such as that described by Fields and Song (Fields and 
Song, Nature 340: 245-246 (1989)) can be used to identify polypeptides that interact 
with one or more HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptides. In such a yeast two-hybrid system, vectors are 

25 constructed based on the flexibility of a transcription factor that has two functional 
domains (a DNA binding domain and a transcription activation domain). If the two 
domains are separated but fused to two different proteins that interact with one 
another, transcriptional activation can be achieved, and transcription of specific 
markers (e.g., nutritional markers such as His and Ade, or color markers such as 

30 lacZ) can be used to identify the presence of interaction and transcriptional 

activation. For example, in the methods of the invention, a first vector is used that 
includes a nucleic acid encoding a DNA binding domain and an HDAC9, HDAC9a, 
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HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide, variant, or 
fragment or derivative thereof, and a second vector is used that includes a nucleic 
acid encoding a transcription activation domain and a nucleic acid encoding a 
polypeptide that potentially may interact with the HDAC9, HDAC9a, 
5 HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide, variant, or 
fragment or derivative thereof (eg., an HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) polypeptide substrate or receptor). Incubation 
of yeast containing the first vector and the second vector under appropriate 
conditions (e.g., mating conditions such as used in the MATCHMAKER™ system 

1 0 from Clontech) allows identification of colonies that express the markers of 

HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS). These 
colonies can be examined to identify the polypeptide^) that interact with the 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide or fragment or derivative thereof. Such polypeptides may be useful as 

15 compounds that alter the activity or expression of an HDAC9, HDAC9a, 

HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide, as described 
above. 

In more than one embodiment of the above assay methods of the present 
invention, it may be desirable to immobilize an HDAC9, HDAC9a, 

20 HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide, or an HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) substrate, or other 
components of the assay on a solid support, in order to facilitate separation of 
complexed from uncomplexed forms of one or both of the polypeptides, as well as 
to accommodate automation of the assay. Binding of a candidate compound to the 

25 polypeptide, or interaction of the polypeptide with a substrate in the presence and 
absence of a candidate compound, can be accomplished in any vessel suitable for 
containing the reactants. Examples of such vessels include microtitre plates, test 
tubes, and micro-centrifuge tubes. In one embodiment, a fusion protein (e.g., a 
glutathione-S-transferase fusion protein) can be provided that adds a domain that 
30 allows HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) or 
an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
substrate to be bound to a matrix or other solid support. 
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In another embodiment, modulators of expression of nucleic acid molecules 
of the invention are identified in a method wherein a cell, cell lysate, tissue, tissue 
lysate, or solution containing a nucleic acid encoding HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) is contacted with a candidate 
5 compound and the expression of appropriate mRNA or polypeptide (e.g., variant(s)) 
in the cell, cell lysate, tissue, or tissue lysate, or solution, is determined. The level 
of expression of appropriate mRNA or polypeptide(s) in the presence of the 
candidate compound is compared to the level of expression of mRNA or 
polypeptide(s) in the absence of the candidate compound, or in the presence of the 

1 0 candidate compound vehicle only. The candidate compound can then be identified 
as a modulator of expression based on this comparison. For example, when 
expression of mRNA or polypeptide is greater (statistically significantly greater) in 
the presence of the candidate compound than in its absence, the candidate 
compound is identified as a stimulator or enhancer of the mRNA or polypeptide 

15 expression. Alternatively, when expression of the mRNA or polypeptide is less 

(statistically significantly less) in the presence of the candidate compound than in its 
absence, the candidate compound is identified as an inhibitor of the mRNA or 
polypeptide expression. The level of mRNA or polypeptide expression in the cells 
can be determined by methods described herein for detecting mRNA or polypeptide. 

20 This invention further pertains to novel compounds identified by the 

above-described screening assays. Accordingly, it is within the scope of this 
invention to further use a compound identified as described herein in an appropriate 
animal model. For example, a compound identified as described herein (e.g., a 
candidate compound that is a modulating compound such as an antisense nucleic 

25 acid molecule, a specific antibody, or a polypeptide substrate) can be used in an 
animal model to determine the efficacy, toxicity, or side effects of treatment with 
such a compound. Alternatively, a compound identified as described herein can be 
used in an animal model to determine the mechanism of action of such a compound. 
Furthermore, this invention pertains to uses of novel compounds identified by the 

30 above-described screening assays for treatments as described herein. In addition, a 
compound identified as described herein can be used to alter activity of an HDAC9, 
HDAC9a, HDAC9(ANLS), KDAC9a(ANLS), or HDRP(ANLS) polypeptide, or to 
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5 



alter expression of HDAC9, HDACPa, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS), by contacting the polypeptide or the nucleic acid molecule (or 
contacting a cell comprising the polypeptide or the nucleic acid molecule) with the 
compound identified as described herein. 



PHARMACEUTICAL COMPOSITIONS 

The present invention also pertains to pharmaceutical compositions 
comprising nucleic acids described herein, particularly nucleotides encoding the 
polypeptides described herein; comprising polypeptides described herein {e.g., SEQ 
10 ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO:10, and/or 
other variants encoded by HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS)); and/or comprising a compound that alters (e.g., increases or 
decreases) HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), oxHDRP(ANLS) 
expression or HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
1 5 HDRP(ANLS) polypeptide activity as described herein. For instance, a polypeptide, 
protein, fragment, fusion protein or prodrug thereof, or a nucleotide or nucleic acid 
construct (vector) comprising a nucleotide of the present invention, a compound that 
alters HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide activity, a compound that alters HDAC9, HDAC9a, HDAC9(ANLS), 
20 HDACPaCANLSXormm^LSJmcMcscidex^^o^oTmHDAC^ 

HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) substrate or binding 
partner, can be formulated with a physiologically acceptable carrier or excipient to 
prepare a pharmaceutical composition. The carrier and composition can be sterile. 
The formulation should suit the mode of administration. 
25 Suitable pharmaceutical^ acceptable carriers include but are not limited to 

water, salt solutions (e.g., NaCl), saline, buffered saline, alcohols, glycerol, ethanoL 
gum arable, vegetable oils, benzyl alcohols, polyethylene glycols, gelatin, 
carbohydrates such as lactose, amylose or starch, dextrose, magnesium stearate, talc, 
silicic acid, viscous paraffin, perfume oil, fatty acid esters, hydroxymethylcellulose,' 
30 polyvinyl pyrolidone, etc., as well as combinations thereof. The pharmaceutical 
preparations can, if desired, be mixed with auxiliary agents, e.g., lubricants, 
preservatives, stabilizers, wetting agents, emulsifiers, salts for influencing osmotic 
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pressure, buffers, coloring, flavoring and/or aromatic substances and the like that do 
not deleteriously react with the active compounds. 

The composition, if desired, can also contain minor amounts of wetting or 
emulsifying agents, or pH buffering agents. The composition can be a liquid 
5 solution, suspension, emulsion, tablet, pill, capsule, sustained release formulation, 
or powder. The composition can be formulated as a suppository, with traditional 
binders and carriers such as triglycerides. Oral formulation can include standard 
carriers such as pharmaceutical grades of mannitol, lactose, starch, magnesium 
stearate, polyvinyl pyrollidone, sodium saccharine, cellulose, magnesium carbonate, 
10 etc. 

Methods of introduction of these compositions include, but are not limited 
to, intradermal, intramuscular, intraperitoneal, intraocular, intravenous, 
subcutaneous, topical, oral and intranasal. Other suitable methods of introduction 
can also include gene therapy (as described below), rechargeable or biodegradable 

15 devices, particle acceleration devises ("gene guns") and slow release polymeric 
devices. The pharmaceutical compositions of this invention can also be 
administered as part of a combinatorial therapy with other compounds. 

The composition can be formulated in accordance with the routine 
procedures as a pharmaceutical composition adapted for administration to human 

20 beings. For example, compositions for intravenous administration typically are 

solutions in sterile isotonic aqueous buffer. Where necessary, the composition may 
i also include a solubilizing agent and a local anesthetic to ease pain at the site of the 
injection. Generally, the ingredients are supplied either separately or mixed together 
in unit dosage form, for example, as a dry lyophilized powder or water free 

25 concentrate in a hermetically sealed container such as an ampule or sachette 
indicating the quantity of active compound. Where the composition is to be 
administered by infusion, it can be dispensed with an infusion bottle containing 
sterile pharmaceutical grade water, saline or dextrose/water. Where the composition 
is administered by injection, an ampule of sterile water for injection or saline can be 

30 provided so that the ingredients may be mixed prior to administration. 

For topical application, nonsprayable forms, viscous to semi-solid or solid 
forms comprising a carrier compatible with topical application and having a 
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15 



20 



25 



30 



dynamic viscosity preferably greater than water, can be employed. Suitable 
formulations include but are not limited to solutions, suspensions, emulsions, 
creams, ointments, powders, enemas, lotions, sols, liniments, salves, aerosols, etc., 
that are, if desired, sterilized or mixed with auxiliary agents, e.g., preservatives, 
5 stabilizers, wetting agents, buffers or salts for influencing osmotic pressure, efo 
The compound maybe incorporated into a cosmetic formulation. For topical 
application, also suitable are sprayable aerosol preparations wherein the active 
ingredient, preferably in combination with a solid or liquid inert carrier material, is 
packaged in a squeeze bottle or in admixture with a pressurized volatile, normally 
10 gaseous propellant, e.g., pressurized air. 

Compounds described herein can be formulated as neutral or salt forms. 
Pharmaceutical^ acceptable salts include those formed with free amino groups such 
as those derived from hydrochloric, phosphoric, acetic, oxalic, tartaric acids, etc., 
and those formed with free carboxyl groups such as those derived from sodium, ' 
potassium, ammonium, calcium, ferric hydroxides, isopropylamine, triethylamine, 
2-ethylamino ethanol, histidine, procaine, etc. 

The compounds are administered in a therapeutically effective amount. The 
amount of compounds that will be therapeutically effective in the treatment of a 
particular disorder or condition will depend on the nature of the disorder or 
condition, and can be determined by standard clinical techniques. In addition, in 
vitro or in vivo assays may optionally be employed to help identify optimal dosage 
ranges. The precise dose to be employed in the formulation will also depend on the 
route of admirustration, and the seriousness of the symptoms of a cell proliferation 
disease, an apoptotic disease, or a cell differentiation disease, and should be decided 
according to the judgment of a practitioner and each patient's circumstances. 
Effective doses may be extrapolated from dose-response curves derived from in 
vitro or animal model test systems. 

The invention also provides a pharmaceutical pack or kit comprising one or 
more containers filled with one or more of the ingredients of the pharmaceutical 
compositions of the invention. Optionally associated with such containers) can be 
a notice in the form prescribed by a governmental agency regulating the 
manufacture, use or sale of pharmaceuticals or biological products, that notice 
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reflects approval by the agency of manufacture, use of sale for human 
administration. The pack or kit can be labeled with information regarding mode of 
administration, sequence of drug administration (e.g., separately, sequentially or 
concurrently), or the like. The pack or kit may also include means for reminding the 
5 patient to take the therapy. The pack or kit can be a single unit dosage of the 
combination therapy or it can be a plurality of unit dosages. In particular, the 
compounds can be separated, mixed together in any combination, present in a single 
vial or tablet. Compounds assembled in a blister pack or other dispensing means is 
preferred. For the purpose of this invention, unit dosage is intended to mean a 
10 dosage that is dependent on the individual pharmacodynamics of each compound 
and administered in FDA approved dosages in standard time courses. 

METHODS OF THERAPY 

The present invention also pertains to methods of treatment (prophylactic, 

15 diagnostic, and/or therapeutic) for a cell proliferation disease, an apoptotic disease, 
or a cell differentiation disease, using an HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) therapeutic compound. An "HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) therapeutic 
compound" is a compound that alters (e.g., enhances or inhibits) HDAC9, HDAC9a, 

20 HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide activity and/or 
HDAC9, HDAC9a 9 HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) nucleic acid 
molecule expression, as described herein (e.g., an HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) agonist or antagonist). 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 

25 therapeutic compounds can alter HDAC9, HDAC9a, HDAC9(ANLS), 

HDAC9a(ANLS), or HDRP(ANLS) polypeptide activity or nucleic acid molecule 
expression by a variety of means, such as, for example, by providing additional 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide or by upregulating the transcription or translation of the HDAC9, 

30 HDAC9a> HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) nucleic acid 
molecule; by altering post-translational processing of the HDAC9, HDAC9a, 
HDAC9(ANLS), HD AC9a(ANLS), or HDRP(ANLS) polypeptide; by altering 
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transcription of HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or 
HDRP(ANLS) variants; or by interfering with HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) polypeptide activity (e.g., by binding to an 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
5 polypeptide), or by downregulating the transcription or translation of the HDAC9, 
HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or HDRP(ANLS) nucleic acid 
molecule. Representative HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), 
or HDRP(ANLS) therapeutic compounds include the following: nucleic acids or 
fragments or derivatives thereof described herein, particularly nucleotides encoding 
10 the polypeptides described herein and vectors comprising such nucleic acids (e.g., a 
nucleic acid molecule, cDNA and/or RNA, such as a nucleic acid encoding an 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide or active fragment or derivative thereof, or an oligonucleotide; for 
example, SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, or SEQ ID 
15 NO: 9, which may optionally comprise at least one polymorphism, or a nucleic acid 
encoding SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID 
NO: 10, or fragments or derivatives thereof); polypeptides described herein (e.g., 
SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8 SEQ ID NO: 10 ' 
and/or other variants encoded by HDAC9, HDAC9a, HDAC9(ANLS), 
:0 HDAC9a(ANLS), or HDRP(ANLS), or fragments or derivatives thereof); HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) substrates; 
peptidomimetics; fusion proteins or prodrugs thereof; antibodies (e.g., an antibody 
to a mutant HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide, or an antibody to a non-mutant HDAC9, HDAC9a, 
5 HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide, or an antibody to 
a particular variant encoded by HDAC9, HDAC9a, HDAC9(ANLS), 
HDA C9a(ANLS), or HDRP(ANLS), as described above); ribozymes; other small 
molecules; and other compounds that alter (e.g., enhance or inhibit) HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a( 6NLS), oxHDRP(ANLS) nucleic acid 
expression or polypeptide activity, for example, those compounds identified in the 
screening methods described herein, or that regulate transcription of HDAC9, 
HDAC9a, HDAC9(ANLS), HDA C9a( ANLS), or HDRP(ANLS) variants (e.g., 
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compounds that affect which variants are expressed, or that affect the amount of 
each variant that is expressed. More than one HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) therapeutic compound can be used 
concurrently, if desired. 
5 The HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 

HDRP(ANLS) therapeutic compound that is a nucleic acid is used in the treatment 
of a cell proliferation disease, an apoptotic disease, or a cell differentiation disease. 
The term, "treatment" as used herein, refers not only to ameliorating symptoms 
associated with the disease, but also preventing or delaying the onset of the disease, 

10 and also lessening the severity or frequency of symptoms of the disease. The 

therapy is designed to alter (e.g., inhibit or enhance), replace or supplement activity 
of an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide in an individual. For example, an HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) therapeutic compound can be administered in 

1 5 order to upregulate or increase the expression or availability of the HDAC9, 

HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) nucleic acid molecule 
or of specific variants of HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS), or, conversely, to downregulate or decrease the expression or 
availability of the HDAC9, HDAC9a 9 HDAC9(ANLS) 9 HDA C9a(ANLS) 9 or 

20 HDRP(ANLS) nucleic acid molecule or specific variants of HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS). Upregulation or increasing 
expression or availability of a native HDAC9, HDAC9a, HDAC9(ANLS), 
HDA C9a( ANLS), or HDRP(ANLS) nucleic acid molecule or of a particular variant 
could interfere with or compensate for the expression or activity of a defective gene 

25 or another variant; downregulation or decreasing expression or availability of a 
native HDAC9, HDAC9a, HDAC9(ANLS) ) HDAC9a(ANLS), or HDRP(ANLS) 
nucleic acid molecule or of a particular variant could minimize the expression or 
activity of a defective gene or the particular variant and thereby minimize the impact 
of the defective gene or the particular variant. 

30 The HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 

HDRP(ANLS) therapeutic compound(s) are administered in a therapeutically 
effective amount (z.e., an amount that is sufficient to treat the disease, such as by 
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ameliorating symptoms associated with the disease, preventing or delaying the onset 
of the disease, and/or also lessening the severity or frequency of symptoms of the 
disease). The amount that will be therapeutically effective in the treatment of a 
particular individual's disorder or condition will depend on the symptoms and 
5 severity of the disease, and can be deteimined by standard clinical techniques. In 
addition, in vitro or in vivo assays may optionally be employed to help identify 
optimal dosage ranges. The precise dose to be employed in the formulation will 
also depend on the route of administration, and the seriousness of the disease or 
disorder, and should be decided according to the judgment of a practitioner and each 
10 patient's circumstances. Effective doses may be extrapolated from dose-response 
curves derived from in vitro or animal model test systems. 

In one embodiment, a nucleic acid of the invention (e.g., a nucleic acid 
encoding an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide, such as SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, 
SEQ ID NO: 7, or SEQ ID NO: 9, which may optionally comprise at least one 
polymorphism, or a nucleic acid that encodes an HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide or a variant, 
derivative or fragment thereof such as a nucleic acid encoding the protein of SEQ 
ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ ID NO: 10) can 
be used, either alone or in a pharmaceutical composition as described above. For 
example, HDAC9, HDAC9a, HDA C9(ANLS), HDA C9a(ANLS), orHDRP(ANLS) or 
a cDNA encoding an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide, either by itself or included within a vector, can be 
introduced into cells (either in vitro or in vivo) such that the cells produce native 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide. If desired, cells that have been transformed with the gene or cDNA or 
a vector comprising the gene or cDNA can be introduced (or re-introduced) into an 
individual affected with the disease. Thus, cells that, in nature, lack native HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) expression and 
activity, or have mutant HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) expression and activity, or have expression of a disease-associated 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) variant, 
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can be engineered to express an HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) polypeptide or an active fragment of an 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide (or a different variant of an HDAC9, HDAC9a, HDAC9(ANLS), 
5 HDAC9a(ANLS), or HDRP(ANLS) polypeptide). In a preferred embodiment, 

nucleic acid encoding the HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide, or an active fragment or derivative thereof, can be 
introduced into an expression vector, such as a viral vector, and the vector can be 
introduced into appropriate cells in an animal. Other gene transfer systems, 
1 0 including viral and nonviral transfer systems, can be used. Alternatively, nonviral 
gene transfer methods, such as calcium phosphate coprecipitation, mechanical 
techniques (e.g., microinjection); membrane fusion-mediated transfer via liposomes; 
or direct DNA uptake, can also be used to introduce the desired nucleic acid 
molecule into a cell. 

1 5 Alternatively, in another embodiment of the invention, a nucleic acid of the 

invention; a nucleic acid complementary to a nucleic acid of the invention; or a 
portion of such a nucleic acid (e.g, an oligonucleotide as described below), can be 
used in "antisense" therapy, in which a nucleic acid (e.g., an oligonucleotide) that 
specifically hybridizes to the RNA and/or genomic DNA of HDAC9, HDAC9a, 

20 HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) is administered or generated in 
situ. The antisense nucleic acid that specifically hybridizes to the RNA and/or DNA 
inhibits expression of the HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) nucleic acid molecule, e.g., by inhibiting translation and/or 
transcription. Binding of the antisense nucleic acid can be by conventional base pair 

25 complementarity, or, for example, in the case of binding to DNA duplexes, through 
specific interaction in the major groove of the double helix. 

An antisense construct of the present invention can be delivered, for 
example, as an expression plasmid as described above. When the plasmid is 
transcribed in the cell, it produces RNA that is complementary to a portion of the 

30 mRNA and/or DNA that encodes an HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) polypeptide. Alternatively, the antisense 
construct can be an oligonucleotide probe which is generated ex vivo and introduced 
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into cells; it then inhibits expression by hybridizing with the mRNA and/or genomic 
DNAof HDAC9,HDAC9a,HDAC9(ANLS),HDAC9a(ANLS), or HDRP(ANLS). In 
one embodiment, the oligonucleotide probes are modified oligonucleotides that are 
resistant to endogenous nucleases, e.g. exonucleases and/or endonucleases, thereby 
5 rendering them stable in vivo. Exemplary nucleic acid molecules for use as 
antisense oligonucleotides are phosphoramidate, phosphothioate and 
methylphosphonate analogs of DNA (see also U.S. Patent Nos. 5,176,996; 
5,264,564; and 5,256,775). Additionally, general approaches to constructing 
oligomers useful in antisense therapy are also described, for example, by Van der 
10 Krol et ah, Biotechniques 6: 958-976 (1988); and Stein et al, Cancer Res 48: 
2659-2668 (1988). With respect to antisense DNA, oligodeoxyribonucleotides 
derived from the translation initiation site, e.g. between the -10 and +10 regions of 
an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) nucleic 
acid sequence, are preferred. 

To perform antisense therapy, oligonucleotides (RNA, cDNA or DNA) are 
designed that are complementary to mRNA encoding an HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide. The antisense 
oligonucleotides bind to HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) mRNA transcripts and prevent translation. Absolute 
complementarity, although preferred, is not required. A sequence "complementary" 
to a portion of an RNA, as referred to herein, indicates that a sequence has sufficient 
complementarity to be able to hybridize with the RNA forming a stable duplex; in 
the case of double-stranded antisense nucleic acids, a single strand of the duplex 
DNA may thus be tested, or triplex formation may be assayed. The ability to 
hybridize will depend on both the degree of complementarity and the length of the 
antisense nucleic acid, as described in detail above. Generally, the longer the 
hybridizing nucleic acid, the more base mismatches with an RNA it may contain and 
still form a stable duplex (or triplex, as the case may be). One skilled in the art can 
ascertain a tolerable degree of mismatch by use of standard procedures. 

The oligonucleotides used in antisense therapy can be DNA, RNA, or 
chimeric mixtures or derivatives or modified versions thereof, single-stranded or 
double-stranded. The oligonucleotides can be modified at the base moiety, sugar 
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moiety, or phosphate backbone, for example, to improve stability of the molecule, 
hybridization, etc. The oligonucleotides can include other appended groups such as 
peptides {e.g. for targeting host cell receptors in vivo), or compounds facilitating 
transport across the cell membrane (see, e.g., Letsinger et al. 9 Proc. Natl. Acad. Sci. 
5 USA 86: 6553-6556 (1989); Lemaitre et al, Proc. Natl. Acad Sci. USA 84: 648-652 
(1987); PCT International Publication No. W088/09810)) or the blood-brain barrier 
(see, e.g., PCT International Publication No. W089/10134), or 
hybridization-triggered cleavage agents (see, e.g., Krol et aL, BioTechniques 6: 
958-976 (1988)) or intercalating agents. (See, e.g., Zon, Pharm. Res. 5: 539-549 

10 (1988)). To this end, the oligonucleotide may be conjugated to another molecule 
{e.g., a peptide, hybridization triggered cross-linking agent, transport agent, 
hybridization-triggered cleavage agent). 

The antisense molecules are delivered to cells that express HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) in vivo. A number of 

15 methods can be used for delivering antisense DNA or RNA to cells; e.g., antisense 
molecules can be injected directly into the tissue site, or modified antisense 
molecules, designed to target the desired cells {e.g., antisense linked to peptides or 
antibodies that specifically bind receptors or antigens expressed on the target cell 
surface) can be administered systematically. Alternatively, in a preferred 

20 embodiment, a recombinant DNA construct is utilized in which the antisense 

oligonucleotide is placed under the control of a strong promoter {e.g., pol in or pol 
IT). The use of such a construct to transfect target cells in the patient results in the 
transcription of sufficient amounts of single stranded RNAs that will form 
complementary base pairs with the endogenous HDAC9, HDAC9a, HDAC9(ANLS), 

25 HDAC9a(ANLS) 9 or HDRP(ANLS) transcripts and thereby prevent translation of the 
HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or HDRP(ANLS) mRNA. 
For example, a vector can be introduced in vivo such that it is taken up by a cell and 
directs the transcription of an antisense RNA. Such a vector can remain episomal or 
become chromosomally integrated, as long as it can be transcribed to produce the 

30 desired antisense RNA. Such vectors can be constructed by recombinant DNA 
technology methods standard in the art and described above. For example, a 
plasmid, cosmid, YAC, or viral vector can be used to prepare the recombinant DNA 
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construct that can be introduced directly into the tissue site. Alternatively, viral 
vectors can be used that selectively infect the desired tissue, in which case 
a<hninistration may be accomplished by another route (e.g., systematically). 

Endogenous HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or . 
5 HDRP(ANLS) expression can also be reduced by inactivating or "knocking out" 
HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or HDRP(ANLS) nucleic acid 
sequences or their promoters using targeted homologous recombination (e.g., see 
Smithies et al, Nature 317: 230-234 (1985); Thomas and Capecchi, Cell 51: 
503-512 (1987); Thompson etal., Cell 5: 313-321 (1989)). For example, a mutant, 
10 non-functional HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) (or a completely unrelated DNA sequence) flanked by DNA 
homologous to the endogenous HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) (either the coding regions or regulatory regions 
of HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(AMLS)) can be 
1 5 used, with or without a selectable marker and/or a negative selectable marker, to 
transfect cells that express HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or 
HDRP(ANLS) in vivo. Insertion of the DNA construct, via targeted homologous 
recombination, results in inactivation of HDAC9, HDAC9a, HDA C9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS). The recombinant DNA constructs can be 
directly administered or targeted to the required site in vivo using appropriate 
vectors, as described above. Alternatively, expression of non-mutant HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), oxHDRP(ANLS) can be increased 
using a similar method: Targeted homologous recombination can be used to insert a 
DNA construct comprising a non-mutant, functional HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(MLS) (e.g., a gene having SEQ ID 
NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, or SEQ ID NO: 9, which 
may optionally comprise at least one polymorphism), or a portion thereof, in place 
of a mutant HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or HDRP(ANLS) 
in the cell, as described above. In another embodiment, targeted homologous 
recombination can be used to insert a DNA construct comprising a nucleic acid that 
encodes an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide variant that differs from that present in the cell. 
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Alternatively, endogenous HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(&NLS), or HDRP(ANLS) expression can be reduced by targeting 
deoxyribonucleotide sequences complementary to the regulatory region of HDAC9, 
HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or HDRP(ANLS) (z.e. , the HDAC9, 
5 HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or HDRP(ANLS) promoter and/or 
enhancers) to form triple helical structures that prevent transcription of HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) in target cells in the 
body. (See generally, Helene Anticancer Drug Des., 6(6): 569-84 (1991); Helene et 
al, Ann, N.Y. Acad. Sci., 660: 27-36 (1992); andMaher, Bioassays 14(12): 807-15 

10 (1992)). Likewise, the antisense constructs described herein, by antagonizing the 
normal biological activity of one of the HDAC9, HDAC9a, HDAC9(ANLS), 
HD AC9a(ANLS), or HDRP(ANLS) proteins, can be used in the manipulation of 
tissue, e.g., tissue differentiation, both in vivo and for ex vivo tissue cultures. 
Furthermore, the antisense techniques (e.g., microinjection of antisense molecules, 

15 or transfection with plasmids whose transcripts are anti-sense with regard to an 
HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or HDRP(ANLS) mRNA or 
gene sequence) can be used to investigate role of HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) in developmental events, as 
well as the normal cellular function of HDAC9, HDAC9a, HDAC9(ANLS), 

20 HDAC9a(ANLS), or HDRP(ANLS) in adult tissue. Such techniques can be utilized 
in cell culture, but can also be used in the creation of transgenic animals. 

In yet another embodiment of the invention, other HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) therapeutic compounds as 
described herein can also be used in the treatment or prevention of a cell 

25 proliferation disease, an apoptotic disease, or a cell differentiation disease. The 

therapeutic compounds can be delivered in a composition, as described above, or by 
themselves. They can be administered systemically, or can be targeted to a 
particular tissue. The therapeutic compounds can be produced by a variety of 
means, including chemical synthesis; recombinant production; in vivo production 

30 (e.g., a transgenic animal, such as U.S. Patent No. 4,873,316 to Meade et al), for 
example, and can be isolated using standard means such as those described herein. 
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A combination of any of the above methods of treatment {e.g., 
administration of non-mutant HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDKP(ANLS) polypeptide in conjunction with antisense 
therapy targeting mutant HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
5 HDRP(ANLS) mRNA; administration of a first variant encoded by HDAC9, 

HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), oxHDRP(ANLS) in conjunction with 
antisense therapy targeting a second encoded by HDA C9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS), can also be used. 

In another embodiment, the invention is directed to HDAC9, HDAC9a, 

10 HDAC9(ANLS), HDAC9a(ANLS), or HDKP(ANLS) nucleic acid molecules and 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptides for use as a medicament in therapy. For example, the nucleic acid 
molecules or polypeptides of the present invention can be used in the treatment of a 
cell proliferation disease, an apoptotic disease, or a cell differentiation disease. In 

15 addition, the HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or 

HDRP(ANLS) nucleic acid molecules and HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) polypeptides described herein can be used in 
the manufacture of a medicament for the treatment of a cell proliferation disease, an 
apoptotic disease, or a cell differentiation disease. 

20 The invention will be further described by the following non-limiting 

examples. The teachings of all publications cited herein are incorporated herein by 
reference in their entirety. 



EXEMPLIFICATION 
25 Cloning ofcDNA encodes a novel HDAC, designated HDAC9 

HDAC9 was cloned by PCR and 3' rapid amplification of cDNA ends using 
primers designed from the sequence of human chromosome 7 whose translated 
product exhibited 80% identity to the HDAC domain of HDAC4, described in detail 
as follows. 

30 Database analyses indicate that HDRP is located on chromosome 7 (7pl 5- 

p21). The human genome database (February 2001 release) of GenBank was 
searched using the human HDAC4 amino acid sequence. The TBLASTN program 
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was used to identify open reading frames downstream of HDRP on chromosome 7 
that exhibit significant homology to the HDAC domain of HDAC4. Several 
fragments whose translated products exhibit over 58% identity were retrieved. Two 
sense primers (OL486, 5'-CCATGGAAACGGTACCCAGCAGGC-3» (SEQ ID NO: 
5 16) and OL487, 5-CACTCCATCGCTATGATGAAGGG-3' (SEQ ID NO: 17)) and 
antisense primers (01484, S'-AGTTCCCTTCATCATAGCGATGG-S 1 (SEQ ID 
NO: 18) and OL485, 5 ! -AATGTACAGGATGCTGGGGT-3' (SEQ ID NO: 19)) 
each were designed based upon one of these fragments whose translated products 
matched amino acids 842-873 of HDAC4. RT-PCR was performed using each of 

10 the antisense primers and a sense primer 

(S'-CCCTTGTAGCTGGTGGAGTTCCCTT-S' (SEQ ID NO: 20)) from the coding 
region of HDRP and human brain cDNA as a template. PCR was performed in a 
Biometra TGRADDENT Thermocycler for 30 cycles at 95°C for 20 seconds, 60°C 
for 20 seconds, and 72°C for 120 seconds. 

1 5 3 '-rapid amplification of cDNA ends was performed using the sense primer 

OL486 and adaptor primer 1 (Clontech), and marathon-ready cDNA from human 
brain (Clontech, Palo Alto, CA) according to the manufacturers instruction. The 
products were re-amplified using nested sense primer OL487 and adaptor primer 2 
(Clontech, Palo Alto, CA). PCR products were cloned into pGEM-T-easy vector 

20 (Promega, Madison, WT) and sequenced using an automated DNA sequencer at the 
DNA Sequencing Core Facility of the Memorial Sloan-Kettering Cancer Center, 
using DNA sequencing methods known to one of skill in the art. 

Two cDNAs were cloned from the above-described methods. One cDNA 
(SEQ ID NO:l) encodes an HDAC9 protein that is 101 1 amino acids in length. The 

25 other cDNA (SEQ ID NO: 3) encodes an HDAC9a protein that is 879 amino acids 
long. The cDNA sequence and amino sequence of HDAC9 and HDAC9a are shown 
in FIGS. 1 A-1G and FIGS. 2A-2B, respectively. Database analyses of these cDNAs 
against human genomic DNA sequences indicated that these two cDNAs are 
generated by alternatively splicing. An alignment of HDAC9, HDAC9a, HDRP, 

30 and HDAC4 is shown in FIGS. 3A-3C. 

Each of the HDAC9 and HDAC9a nucleic acid sequences were cloned into 
the pFLAG-CMV-5b vector (Sigma) in frame with the C-terminal FLAG tag. Only 
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the coding regions plus three extra base pairs (ACQ of cDNA of the HDAC9 and 
HDAC9a nucleic acid sequences were included in the constructs. These constructs 
are referred to herein as HDAC9-FLAG and HDAC9a-FLAG, respectively. These 
constructs are contained in E. coli, and can readily be expressed. For HDAC9, the 
insert is 3033 bp and for HDAC9a, the insert size is 2637 bp. Both HDAC9 and 
HDAC9a can be released with EcoRV and Bamffl (whose sites have been 
incorporated in the primers to obtain HDAC9 and HDAC9a coding cDNA for 
cloning purpose) restriction enzyme digestion. 

The HDAC9 cDNA sequences from the known 5*-end of HDRP cDNA to the 
S'-untranslated region cloned in this study cover over 51 1 kb of genomic DNA on 
chromosome 7. As shown in FIG. 4, the coding region cDNA of HDAC9 resides in 
23 exons spanning 458 kb of genomic sequence. Exons 21, 22, and 23 are one 
single exon in HDAC9a, but the middle exon that is numbered exon 22 in FIG. 4, 
containing an in-frame stop codon, is spliced out in HDAC9. Li addition, exons 12 
and 13 are a single exon used by HDRP. Exon 13 is spliced as part of an intron in 
HDAC9 and HDAC9a. 

Further analysis revealed that exon 7, which contains a nuclear localization 
signal (NLS) is alternatively spliced in an HDRP isoform, creating HDRP(ANLS). 
RT-PCR analyses using primers based on sequences from exon 6 and exon 14 
indicate that this alternative splicing event also occurs in HDAC9 and/or HDAC9a. 
Thus, it is possible that at least 6 proteins can be generated from a single HDAC9 
gene by alternatively splicing of its RNA. The cDNA sequences and amino acid 
sequences for HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), and 
HDRP(ANLS) are shown in FIGS. lA-lO and 2A-2E, respectively. 

HDAC9 mRNA is differentially expressed among human tissues 

The expression of HDAC9 mRNA was determined by Northern blot analysis 
using a human multiple tissue Northern blot (Clontech, Palo Alto, CA). 
Hybridization was performed according to the manufacturer's instruction using 
ExPressHyb solution (Clontech, Palo Alto, CA). The 32 P-random priming labeled 
S'-untranslated region common to both HDAC9 and HDAC9a that shares no 
significant sequence homology with HDRP was used as aprobe. Two transcripts at 
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9.8 and 4.1 kb were detected in all tissues examined (FIG. 6A). The 4.1 kb 
transcript is shorter than the 4.4 kb HDRP transcript (See Zhou, et al y Proc. Natl. 
Acad. Sci. USA, 97:1056-1061 (2000)). A third transcript at 1.2 kb was detected in 
placenta (FIG. 6A). Similar to HDRP (See Zhou, X., et al, Proc. Natl. Acad. Sci. 
5 USA, 97:1056-1061 (2000)), high levels of HDAC9 transcripts were detected in 
brain and skeletal muscle (FIG. 6A). 

The distribution of alternatively spliced mRNA variants among tissues was 
examined by RT-PCR using primers (OL516 5-TGTGTCATCGAGCTGGCTTC-3' 
(SEQ ID NO: 21) and OL517 5'-ATCTTCTGCAAGTGGCTCCA-3' (SEQ ID NO: 

10 22)) spanning the alternatively spliced exon 22 and cDNA panel from the same 
tissues as the multiple tissue Northern blot. PCR was performed in a Biometra 
TGRADIENT Thermocycler for 30 cycles at 95°C for 20 seconds, 60°C for 20 
seconds, and 72°C for 60 seconds. The expected sizes of PCR products were 680 
base pairs for HDAC9 and 993 base pairs for HDAC9a. The ratio of HDAC9 and 

15 HDAC9a transcripts differed among tissues (FIG. 6B). In the placenta and kidney, 
the levels of the two transcripts were about the same (FIG. 6B). In the brain, heart, 
and pancreas, there were more transcripts of HDAC9 than HDAC9a. In the other 
tissues examined, there were more HDAC9a transcripts than HDAC9 transcripts 
(FIG. 6B). Under the conditions tested, HDAC9 transcripts were undetectable in 

20 liver (FIG. 6B). The lung had an HDAC9 product that was larger than expected and 
abundant. The lung also had low levels of HDAC9 transcripts and HDAC9a 
transcripts (FIG. 6B). An additional PCR product was also amplified from cDNA of 
the pancreas; this product was than the expected products from HDAC9 and 
HDAC9a (FIG. 6B). The identity of the different sized transcripts is unknown. 

25 

HDAC9 and HDAC9a possess histone deacetylase activity 

HDAC9 was named based on sequence homology to HDAC4 (FIGS. 3A- 

3C). To determine whether HDAC9 and HDAC9a possess HDAC activity, an 

KDAC enzymatic assay was performed using anti-FLAG immunoprecipitated 
30 HDAC9-FLAG and HDAC9a-FLAG. 

C-terminal FLAG-tagged HDAC9 (HDAC9-FLAG) and HDAC9a 

(HDAC9a-FLAG) expression vectors were constructed using the pFLAG-CMV-5b 
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vector (Sigma) and PCR amplified coding regions of HDAC9 and HDAC9a in 
frame with the FLAG-tag to form pFLAG-CMV-5b-HDAC9 (plasmid VRl) and 
P FLAG-CMV-5b-HDAC9a (plasmid VR2). All constructs were confirmed by DNA 
sequencing. 

Transfection of human kidney 293T cells, immunoprecipitation using anti- 
FLAG M2 Agarose (Sigma), Western blot analyses and dual luciferase assays were 
performed essentially as previously described by Zhou et al. (Proc. Natl. Acad. Sci. 
USA 97:1056-1061 (2000)). Briefly, the cells (American Type Culture Collection) 
were cultured in DME HG medium (GBCO/BRL) supplemented with 10% 
(vol/vol) FBS at 37 °C in a 5% C0 2 atmosphere. Transient tiansfection was 
performed by using Lipofectamine (GQ3CO/BRL) or Fugene 6 (Roche Molecular 
Biochemicals) according to the manufacturers' instructions. Cells were harvested 
24 to 48 hours after transfection and lysed in IP lysis buffer (50 mM Tris HCl, pH 
7.5/120 mM NaCl/5 mM EDTA/0.5% NP-40) at 5 x 10 7 cells per ml. 
Immunoprecipitation with anti-FLAG M2-agarose (Sigma, St. Louis, MO) was 
performed according to the manufacturer's instructions, hnmunoprecipitated 
proteins were released from the agarose beads by using FLAG-peptide and either 
used directly for HDAC enzymatic activity assays or resolved on SDS/PAGE for 
Western blot analyses. Anti-FLAG antibody was purchased from Sigma (St. Louis, 
MO). Western blot analyses were performed using standard methods. 

HDAC9 and HDAC9a enzymatic activity were assessed with Hie HDAC 
Fluorescent Activity Assay/Drug Discovery Kit-AK-500 (BIOMOL Research 
Laboratories) using a FLUOR DE LYS™ that contains an acetylated lysine side 
chain as a substrate and immunoprecipitated HDAC9-FLAG and HDAC9a-FLAG 
polypeptides according to the manufacturer's instruction and a SPECTRAmax® 
GEMINI XS microplate spectrofluorometer using the SOFTmax® PRO system 
(Molecular Devices) at excitation 355 nm and emission 460 nm with a cut off filter 
of 455 nm. Briefly, HDAC9-FLAG and HDAC9a-FLAG were incubated with the 
substrate overnight at room temperature in a 96-well plate. The reaction was 
stopped by addition of Fluor De Lys™ Developer and samples were read with the 
fluorometer. 
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As shown in FIG. 7, both HDAC9-FLAG and HD AC9a-FLAG deacetylated 
the acetylated lysine of FLUOR DE LYS™ and the activity of HDAC9 and 
HDAC9a was comparable. To examine the activity of HDAC9 and HDAC9a, 
inhibition studies using TSA were carried out by preincubating HDAC9-FLAG and 
5 HDAC9a-FLAG with TSA for 15 minutes at room temperature. The assay was then 
carried out as stated above. As shown in FIG. 7, TSA inhibited HDAC9 and 
HDAC9a deacetylase activity. The inset gel in FIG. 7 shows the amount of protein 
used in the assay. SAHA, a potent HDAC inhibitor (Richon et al, Proc. Natl. Acad. 
Sci. USA, 95:3003-3007 (1998)) also completely inhibited thehistone deacetylase 
10 activity of HDAC9-FLAG and HDAC9a-FLAG. The HDAC activity of HDAC9 
and HDAC9a was about ten times lower than the deacetylase activity of HDAC4 
when comparable amount of protein was used under conditions tested here. 

HDAC9 and HDAC9a enzymatic activity was also determined through 
HDAC enzymatic assays using 3 H-histones isolated from murine erythroleukemia 
1 5 cells as a substrate. This assay was performed essentially as described by Richon et 
al (Proc. Natl. Acad. Sci. USA, 95:3003-3007 (1998)). Briefly, HDAC9-FLAG 
and HDAC9a-FLAG were incubated with 3 H-histones overnight at 37°C. The 
reaction was stopped by the addition of 1M HCL/0.1 acetic acid. Released 3 H-acetic 
acid was extracted with ethyl acetate and quantified by scintillation counting. For 
20 inhibition studies, the immunoprecipitated complexes were preincubated with the 
different HDAC inhibitors for 30 minutes at 4*C. 

As shown in FIG. 8, HDAC9a-FLAG deacetylated 3 H-acetyl-histones. 
SAHA, a potent HDAC inhibitor also completely inhibited the histone deacetylase 
activity of HDAC9a-FLAG. TSA also inhibited HDAC9a deacetylase activity. 
25 Similar results were obtained when HDAC9 was used as the enzyme source. 

HDAC9 and HDAC9a repress MEF2-mediated transcription 

The Xenopus homolog of HDRP, MITR, was identified as a MEF2 
interacting transcriptional repressor (Sparrow et al, EMBO J. 18:5085-5098(1999)) 
30 and mouse HDRP also interacts with and represses MEF2 mediated transcription 
(Zhang et aL, J. Biol. Chem. 276:35-39 (2001)). We first tested whether HDAC9- 
FLAG and HDAC9a-FLAG interact with MEF2. 293 cells were transfected with 
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vector, HDAC9-FLAG, or HDAC9a-FLAG. The cells were subsequently lysed and 
HDAC9-FLAG and HDAC9a-FLAG proteins were immunoprecipitated with anti- 
FLAG antibodies. Western blot analysis of the immunoprecipitated proteins was 
carried out, using anti-MEF-2 antibody to probe the blot. As shown in FIG. 9A, 
5 both HDAC9 and HDAC9a interacted with MEF2 in 293T cells. 

It was then determined whether HDAC9 and HDAC9a repress MEF2- 
mediated transcription. This determination was carried out as follows. The 
p3XMEF2-luciferase reporter gene (100 ng) and the vector pRL-TK (Promega) (5 
ng) were co-transfected into 293T cells in the absence (pcDNA3 empty vector) or 
10 presence of MEF2C (100 ng of pCMV-MEF2Q. HDAC9-F (1 ng, 10 ng, or 100 ng 
of pFLAG-HDAC9; pFLAG-HDAC9 and HDAC9-FLAG are different constructs, 
with the FLAG sequence located at opposite ends of the HDAC9 nucleotide, but are 
functionally equivalent) or HDAC9a-F (1 ng, 10 ng, or 100 ng of pFLAG-HDAC9a; 
P FLAG-HDAC9a and HDAC9a-FLAG are different constructs, with the FLAG 
sequence located at opposite ends of the HDAC9a nucleotide, but are functionally 
equivalent) was included in a subset of experimental groups with the MEF2C 
vector. pFLAG empty vector was used to adjust the DNA to an equal amount in 
each transfection. The cells were harvested 24 to 36 hours after transfection and the 
luciferase activities were measured using the Dual-Luciferase™ Reporter Assay 
System from Promega according to the manufacturer's instruction. The firefly 
luciferase activity was first normalized to the co-transfected Renilla luciferase 
activity (encoded by the pRL-TK vector), and the luciferase activity value for cells 
transfected with MEF2C alone was set at 1. MEF2C activated transcription over 30 
times the basal level of transcription. As shown in FIG. 9B, HDAC9-FLAG and 
HDAC9a-FLAG repressed MEF2C mediated transcriptional activation in a dose- 
dependent manner and completely abolished the activation at the 100 ng dose for 
both HDAC9 and HDAC9a. The transcriptional repression effect of HDAC9 and 
HDAC9a on MEF2C mediated transcription was a specific effect since a co- 
transfected reporter gene for transfection efficiency containing a TK promoter was 
30 not repressed by HDAC9 or HDAC9a. 

Described herein is the identification and characterization of a new class H 
HDAC, designated HDAC9. HDAC9 has several alternatively spliced isoforms, 
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one of which is the previously identified HDRP (Zhou et aL 9 - Proc. Natl. Acad. Sci. 
USA 97:1056-1061 (2000)). HDAC9 and HDAC9a possess HDAC activity, which 
appears to have a lower specific enzymatic activity than HDAC4. While not 
wishing to be bound by any particular theory, it is possible that an essential co-factor 
5 is lost during immunoprecipitation or does not exist in 293T cells (for example, 
metastasis-associated protein 2 is essential for the assembly of a catalytically active 
HDAC1 (Zhang et al 9 Genes Dev. 13:1924-1935 (1999)), the substrates used are 
not its natural substrate, or the FLAG tag which interferes with the folding of the 
protein. 

10 Searching the human genome with the HDAC domain from either HDAC1 

or HDAC9 identified a total of 10 HDACs in the presently completed human 
genome sequence, a number of which are schematically represented in FIG. 10. 
HDACs 1, 2, 3, 8, 4, 5, 6, 7, 9, and 9a all have HDAC domains. HDRP, which is 
also schematically depicted in FIG. 10, does not have a catalytic domain. 

1 5 All references described herein are incorporated by reference in their 

entirety. While this invention has been particularly shown and described with 
reference to preferred embodiment thereof, it will be understood by those skilled in 
the art that various changes in form and details may be made therein without 
departing from the spirit and scope of the invention as defined by the appended 

20 claims. 
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CLAIMS 

What is claimed is: 

5 

1. An isolated or recombinant histone deacetylase polypeptide, said polypeptide 
selected from: 

a) an isolated or recombinant polypeptide comprising SEQ ID NO: 2, 
SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ ID NO: 10; 

10 and 

b) an isolated or recombinant polypeptide having at least 60% sequence 
identity with any one of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 
6, SEQ ID NO: 8, or SEQ ID NO: 10. 



The isolated or recombinant histone deacetylase polypeptide of Claim 1, said 
polypeptide selected from: 

a) a polypeptide consisting of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID 
NO: 6, SEQ ID NO: 8, or SEQ ID NO: 10. 

The isolated or recombinant histone deacetylase polypeptide of Claim 1, 
wherein said polypeptide is human. 

An isolated nucleic acid molecule selected from the group: 

a) an isolated nucleic acid comprising SEQ ID NO: 1 , SEQ ID NO: 3, 
SEQ ID NO: 5, SEQ ID NO: 7, or SEQ ID NO: 9; 

b) a complement of an isolated nucleic acid comprising SEQ ID NO: 1, 
SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, or SEQ ID NO: 9 

c) an isolated nucleic acid encoding a histone deacetylase polypeptide 
of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or 
SEQ ID NO: 10; 
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d) a complement of an isolated nucleic acid encoding a histone 
deacetylase polypeptide of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID 
NO: 6, SEQ ID NO: 8, or SEQ ID NO: 10; 

e) a nucleic acid that is hybridizeable under high stringency conditions 
5 to a nucleic acid molecule that encodes any of SEQ ID NO: 2, SEQ 

ID NO: 4, SEQ ID NO: 6, or SEQ ID NO: 8, or a complement 
thereof; or 

f) a nucleic acid molecule that is hybridizeable under high stringency 
conditions to a nucleic acid comprising SEQ ID NO: 1, SEQ ID NO: 

10 3, SEQ ID NO: 5, or SEQ ID NO: 7; and 

g) an isolated nucleic acid molecule that has at least 55% sequence 
identity with any one of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 
5, SEQ ID NO: 7, SEQ ID NO: 9, or a complement thereof. 



15 5 . . The isolated nucleic acid molecule of Claim 4, said nucleic acid molecule 

consisting of the nucleic acid molecule selected from the group consisting of 
SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, and SEQ ID 
NO: 9. 

20 6. The isolated nucleic acid molecule of Claim 4, wherein said nucleic acid 
molecule is human. 



7. A vector comprising the isolated nucleic acid molecule of Claim 4. 
25 8. A cell comprising the vector of Claim 7 . 

9. A cell comprising the isolated nucleic acid molecule of Claim 4. 



1 0, A purified antibody that selectively binds a polypeptide of Claim 1 . 

30 

11. A method of identifying a compound that modulates expression of a nucleic 
acid molecule of Claim 4, said method comprising the steps of: 
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a) contacting said nucleic acid molecule with a candidate compound 
under conditions suitable for expression; and 

b) assessing the level of expression of said nucleic acid molecule, 
wherein a candidate compound that increases or decreases expression of said 

5 nucleic acid molecule relative to a control is a compound that modulates 

expression of said nucleic acid molecule. 

12. The method of Claim 1 1 , wherein said method is carried out in a cell or 
animal. 

10 

1 3 . The method of Claim 1 1 , wherein said method is carried out in a cell free 
system. 

1 4. A method of identifying a compound that modulates the enzymatic activity 
15 of the polypeptide of Claim 1 , said method comprising the steps of: 

a) contacting said polypeptide with a candidate compound under 
conditions suitable for enzymatic reaction; and 

b) assessing the enzymatic activity level of said polypeptide, 
wherein a candidate compound that increases or decreases the enzymatic 

20 activity level of said polypeptide relative to a control is a compound that 

modulates the enzymatic activity of said polypeptide. 

15. The method of Claim 14, wherein said method is carried out in a cell or 
animal. 

25 

1 6. The method of Claim 1 4, wherein said method is carried out in a cell free 
system. 



17. 

30 



The method of Claim 14, wherein said polypeptide is further contacted with 
a substrate for the polypeptide, and wherein said substrate is selected from 
the group consisting of a cell proliferation disease binding agent, an 
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apoptotic disease binding agent, and a cell differentiation disease binding 
agent. 

1 8 . The method of Claim 1 7, wherein said candidate compound is an inhibitor. 

5 

19. The method of Claim 17, wherein said candidate compound is an activator. 

20. A method of identifying a compound that modulates the transcriptional 
repression activity of the polypeptide of Claim 1, said method comprising 

10 the steps of: 

a) contacting said polypeptide with a candidate compound under 
conditions suitable for a transcriptional repression reaction; and 

b) assessing the transcriptional repression activity level of said 
polypeptide, 

15 wherein a candidate compound that increases or decreases the transcriptional 

repression activity level of said polypeptide relative to a control is a 
compound that modulates the transcriptional repression activity of said 
polypeptide. 



20 21. The method of Claim 20, wherein said method is earned out in a cell or 
animal. 



22. The method of Claim 20, wherein said method is carried out in a cell free 
system. 

25 

23. The method of Claim 20, wherein said polypeptide is further contacted with 
a substrate for the polypeptide, and wherein said substrate is selected from 
the group consisting of a cell proliferation disease binding agent, an 
apoptotic disease binding agent, and a cell differentiation disease binding 

30 agent. 



24. 



The method of Claim 23, wherein said candidate compound is an inhibitor. 
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25. The method of Claim 23, wherein said candidate compound is an activator. 

26. A method of identifying a compound that modulates expression of a nucleic 
acid molecule of Claim 4, said method comprising the steps of: 

5 a) providing a nucleic acid molecule comprising a promoter region of 

said nucleic acid of Claim 4 or part of a promoter region of said 
nucleic acid of Claim 4 operahly linked to a reporter gene; 
b) contacting said nucleic acid molecule or with a candidate compound; 
and 

10 c) assessing the level of said reporter gene, 

wherein a candidate compound that increases or decreases expression of said 
reporter gene relative to a control is a compound that modulates expression 
of said nucleic acid molecule of Claim 4. 

1 5 27. The method of Claim 26, wherein said method is carried out in a cell. 

28. A method of identifying a polypeptide that interacts with a polypeptide of 
Claim 1 in a yeast two-hybrid system, said method comprising the steps of: 

a) providing a first nucleic acid vector comprising a nucleic acid 

20 molecule encoding a DNA binding domain and said polypeptide of 

Claim 1; 

b) providing a second nucleic acid vector comprising a nucleic acid 
encoding a transcription activation domain and a nucleic acid 
encoding a test polypeptide; 

25 c) contacting said first nucleic acid vector with said second nucleic acid 

vector in a yeast two-hybrid system; and 
d) assessing transcriptional activation in said yeast two-hybrid system, 
wherein an increase in transcriptional activation relative to a control 
indicates that the test polypeptide is a polypeptide that interacts with said 

3 0 polypeptide of Claim 1 . 



29. 



A pharmaceutical composition comprising a polypeptide of Claim 1. 
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30. A method of diagnosing a cell proliferation disease, an apoptotic disease, or 
a cell differentiation disease in a subject, said method comprising the steps 
of: 

a) obtaining a sample from said subject; and 
5 b) assessing the level of activity or expression of said polypeptide of 

Claim 1 in said sample, or detecting the level of said nucleic acid 
molecule of Claim 4, 
wherein if said level is increased relative to a control, then said subject has 
an increased likelihood of having a cell proliferation disease, an apoptotic 
10 disease, or a cell differentiation disease, and wherein if said level is 

decreased relative to a control, then said subject has a decreased likelihood 
- of having a cell proliferation disease, an apoptotic disease, or a cell 
differentiation disease. 

15 31. The method of Claim 30, wherein said level of activity or expression of said 
) polypeptide of Claim 1 in said sample is measured using 

immunohistochemical techniques. 

32. The method of Claim 30, wherein said level of said nucleic acid molecule of 
20 Claim 4 in said sample is measured using in situ hybridization techniques. 

33. A method of treating a cell proliferation disease, an apoptotic disease, or a 
cell differentiation disease, said method comprising administering a 
compound identified by the method of Claim 14. 

25 

34. A method of treating a cell proliferation disease, an apoptotic disease, or a 
cell differentiation disease, said method comprising administering a 
compound identified by the method of Claim 20. 



30 
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SEQUENCE LISTING 

<110> Sloan-Kettering Institute for Cancer Research 
Richon, Victoria 
Zhou, Xianbo 
Rifkind, Richard A. 
Marks, Paul A. 

<120> HDAC9 Polypeptides and Polynucleotides 
and Uses Thereof 

<130> 3254.1000005 

<150> 60/298,173 
<151> 2001-06-14 

<150> 60/311,686 
<151> 2001-08-10 

<150> 60/316,995 
<151> 2001-09-04 

<160> 22 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 3186 
<212> DNA 

<213> Homo sapiens 
<400> 1 

ggggaagaga ggcacagaca cagataggag aagggcaccg gctggagcca cttgcaggac 60 
tgagggtttt tgcaacaaaa ccctagcagc ctgaagaact ctaagccaga tggggtggct 120 
ggacgagagc agctcttggc tcagcaaaga atgcacagta tgatcagctc agtggatgtg 180 
aagtcagaag ttcctgtggg cctggagccc atctcacctt tagacctaag gacagacctc 240 
aggatgatga tgcccgtggt ggaccctgtt gtccgtgaga agcaattgca gcaggaatta 300 
cttcttatcc agcagcagca acaaatccag aagcagcttc tgatagcaga gtttcagaaa 360 
cagcatgaga acttgacacg gcagcaccag gctcagcttc aggagcatat caaggaactt 420 
ctagccataa aacagcaaca agaactccta gaaaaggagc agaaactgga gcagcagagg 480 
caagaacagg aagtagagag gcatcgcaga gaacagcagc ttcctcctct cagaggcaaa 540 
gatagaggac gagaaagggc agtggcaagt acagaagtaa agcagaagct tcaagagttc 600 
ctactgagta aatcagcaac gaaagacact ccaactaatg gaaaaaatca ttccgtgagc 660 
cgccatccca agctctggta cacggctgcc caccacacat cattggatca aagctctcca 720 
ccccttagtg gaacatctcc atcctacaag tacacattac caggagcaca agatgcaaag 780 
gatgatttcc cccttcgaaa aactgcctct gagcccaact tgaaggtgcg gtccaggtta 840 
aaacagaaag tggcagagag gagaagcagc cccttactca ggcggaagga tggaaatgtt 900 
gtcacttcat tcaagaagcg aatgtttgag gtgacagaat cctcagtcag tagcagttct 960 
ccaggctctg gtcccagttc accaaacaat gggccaactg gaagtgttac tgaaaatgag 1020 
acttcggttt tgccccctac ccctcatgcc gagcaaatgg tttcacagca acgcattcta 1080 
attcatgaag attccatgaa cctgctaagt ctttatacct ctccttcttt gcccaacatt 1140 
accttggggc ttcccgcagt gccatcccag ctcaatgctt cgaattcact caaagaaaag 1200 
cagaagtgtg agacgcagac gcttaggcaa ggtgttcctc tgcctgggca gtatggaggc 1260 
agcatcccgg catcttccag ccaccctcat gttactttag agggaaagcc acccaacagc 1320 
agccaccagg ctctcctgca gcatttatta ttgaaagaac aaatgcgaca gcaaaagctt 1380 
cttgtagctg gtggagttcc cttacatcct cagtctccct tggcaacaaa agagagaatt 1440 
tcacctggca ttagaggtac ccacaaattg ccccgtcaca gacccctgaa ccgaacccag 1500 
tctgcacctt tgcctcagag cacgttggct cagctggtca ttcaacagca acaccagcaa 1560 
ttcttggaga agcagaagca ataccagcag cagatccaca tgaacaaact gctttcgaaa 1620 
tctattgaac aactgaagca accaggcagt caccttgagg aagcagagga agagcttcag 1680 
ggggaccagg cgatgcagga agacagagcg ccctctagtg gcaacagcac taggagcgac 1740 
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SE S5SSS StcS Ctgtgaaggt caaggaggaa 1800 

tttatgcaac ag?ct?£S Saacccaca ™ P aatctgggga gcaggctgct 1860 
ccgctggctg cggttggcat Sa»H» cacacacgtg cgctctctgt gcgccaagct 1920 
tcttcccctg c?gcc?ctgt cttacctcao f gaaacacc ^tctcgtctc caggactcac 1980 
tctgcaactg gaattgccta ^acccct?a a^f^ accgccccct ccagcctggc 2040 
tccaccaccc accctgagca ^gctggacgl a£ca££» "gtggcaat 2100 

actgggctgc taaataaatg tgagcgaaK 5°S? CaC ? act 9^agaa 2160 

cagcttgttc attctgaaca tcactcacta ttltf^T aagccagcct Sgaggaaata 2220 
aagctggacc ccaggatac? cctaggtga? ga£cSaa SXSS ggacgga «g 2280 
tgtggtggac ttggggtaaa caotm^ a agtttttttc ctcattacct 2340 

gcacgca?gg ctg^ggctg" tgtcatcjaa ctaS^^ 3gC * aCactc gtccggtgct 2400 
aagaatgggt ttgctgttg? gaggccccc? S " aagt ^tc aggagagctg 2460 
gggttctgct tt?ttaat?c aJSgcaS acea™ ? ^tgaagaatc cacagccatg 2520 
ataagcaaga tattgattgt agatcSqat attat^ t aCttgagaga ccaactaaat 2580 
ttttatgctg accccagcat cctgtacatt gaaacggtac ccagcaggcc 2640 

ttccctggca gtggagcccc aaaKaaaJt ™ Cat ° gctatgatga agggaacttt 2700 
aatattgcct ggacaggtgg ccttaaSJ ™* affBCC ttggagaag 9 gtacaatata 2760 
ttcaggacca ?IgtJaagcc tSSccaaa SZEF*?* atgttgagta ccttgaagca 2820 
gctggatttg atgcattgga aggccScc CagaCatggt cttagtatct 2880 

aaatgttttg gtcatttglc gtfgcaStg atoacatt™ gagggtacaa a Srtgacggca 2940 
gctctagaag gaggacatga tc^cacaqcc atSS?«K Ctgatggacg ^ggtgttg 3000 
gcccttctag gaaataaact atct gtgatg catcagaagc ctgtgtaaat 3060 

atgaatgctg tKttSS IcagaaSc llT^tt* ttCtccacca aagcccgaat 3120 
tcttaa acagaagatc attgaaattc aaagtatgtc tttaaagttc 3180 

3186 

<210> 2 

<211> 1011 

<212> PRT 

<213> Homo sapiens 

<400> 2 

Met His Ser Met lie Ser Ser Val Asp val Lys Ser Glu Val Pro Val 
Gly Leu Glu Pro He Ser Pro Leu Asp Leu Arg Thr Asp Leu Arg Met 
Met Met Pro Val Val Asp Pro Val Val Arg Glu Lys Gin Leu Gin Gin 
Glu Leu Leu Leu He Gin Gin Gin Gin Gin He Gin Lys Gin Leu Leu 



lie Ala Glu Phe Gin Lys Gl'n His Glu Asn Leu Tnr Arg Gin His Gin 

Ala Gin Leu Gin Glu His He Lys Glu Leu Leu Ala He Lys Gin gL 

Gin Glu Leu Leu Glu Lys Glu Gin Lys Leu Glu Gin Gin Arg Gin Glu 

Gin Glu Val Glu Arg His Arg Arg Glu Gin Gin Leu Pro Pro Leu Arg 

Gly Lys Asp Arg Gly Arg Glu Arg Ala Val Ala Ser Thr Glu Val Lys 

135 140 
Glu Phe Leu Leu Ser Lys Ser Ala Thr 
150 155 

Lys Asn His Ser Val Ser Arg His Pro 
165 170 

His His Thr Ser Leu Asp Gin Ser Ser 
Ser Gly Thr Ser Pro Ser Tyr Lys Tyr Thr Leu Pro Gly Ma Gin Asp 
Ala Lys Asp Asp Phe Pro Leu Arg Lys Thr Ala Ser Glu Pro Asn Leu 

215 990 
g. val Arg Ser Arg Leu Lys Gin Lys Val Ala Glu Arg Arg Ser Ser 

235 240 



Gin Lys Leu Gin Glu Phe Leu Leu Ser Lys Ser Ala Thr Lys Asp Thr 
Asn 

^ Thr Ala Ala His His Thr Ser Leu Asp Gin Ser Ser Pro £t Leu 



"~ 150 -ice - ^ 

Pro Thr Asn Gly Lys Asn His Ser Val Ser Arg His Pro Lys Leu Trp 
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J-ftr U. 


Pi v 
biy 


Leu 


Pro 


Ala 




330 










j j *j 




Asn 


Ser 




J-iy o 


b-L U 


uy o 


Pi n 


T ,vc 
j-iy o 


345 










J J u 






Gly Val 


Pro 


Leu 


Pro 


Pi \r 

biy 


Pi n 
bill 


lyr 










jOj 








Ser 


His 


XT A \J 




V^l 


x iix 




Pi ii 








JOU 










Gin 


Ala 


Leu 


Leu 


Pi n 


m o 


Iter U 


-Lit; U 






J _? D 












Lys 


Leu 


Leu 


val 


al ^ 

Ala. 


PI 

biy 


Pi \r 

biy 


Tra 1 
Val 




410 










/lie; 




Ala 


Thr 


Lys 


PI n 

blU 


Arg 


Tl 

lie 


Ser 


Pro 


425 
















Pro 


Arg 


TT A — 

rllS 


Arg 


Pro 


Leu 


Asn 


Arg 










A AR 








Ser 


Thr 


Leu 


Ala 


Pin 

bin 


Leu 


y a 1 
Val 


Tl 








A C(\ 
4tO U 










Glu 


Lys 


Pi Tl 

bin 


Lys 


bin 


Tyr 


PI r\ 

bin 


Pi r\ 

bin 






Al^ 










f±OU 


Ser 


Lys 


Ser 


Tl a 
11c 


Pill 
blU 


PI n 

bin 


Leu 


Lys 




490 














Ala 


Glu 


Pi n 
blU 


Pi n 
blU 


Leu 


PI n 
bill 


Pi \7 

biy 


Asp 


505 










JlU 






Pro 


Ser 


Ser 


Pi \r 

biy 


Asn 


Ser 


Tnr 


Arg 










z> & ~> 








Asp 


Thr 


Leu 


Pi V 

biy 


V3-L11 


val 


Pi XT 

bJ-y 


Al a 

Al Ct 








RAH 










Asp 


Ser 


Asp 


Pi ii 

blU. 


ASp 


Al a 


bill 


Tl P» 

-L IC 






jjj 










J D VJ 


Ala 


Ala 


file 




a~\ n 

bill 


Pi n 

Ulll 


XT J. 






570 














Leu 


Ser 


Val 


Arg 


Gin 


Ala 


Pro 


Leu 


585 
















Glu Lys 


His 


Ay* ft 
y 


Leu 


Val 


Ser 












605 








Val 


Leu 


Pro 


His 


Pro 


Ala 


Met 


Asp 








O £* v 










Thr Gly 


He 


Ala 


Tvr 
xyx 


Asp 


Pro 


Leu 






635 










640 


Gly Asn 


Ser 


Thr 


Thr 


His 


Pro 


Glu 




650 










655 




Trp 


Ser 


Arg 


Leu 


Gin 


Glu 


Thr 


Gly 


665 










670 






Gin Gly 


Arg 


Lys 


Ala 


Ser 


Leu 


Glu 










685 








His 


His 


Ser 


Leu 


Leu 


Tyr 


Gly 


Thr 








700 










Asp 


Pro 


Arg 


He 


Leu 


Leu 


Gly 


Asp 




715 










720 


Leu 


Pro 


Cys 


Gly 


Gly 


Leu 


Gly 


Val 




730 










735 
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Asp Ser Asp Thr He Trp Asn Glu Leu His Ser Ser Gly Ala Ala Arg 

*r u 745 750 

Met Ala Val Gly Cys Val He Glu Leu Ala Ser Lys Val Ala Ser Gly 

760 765 
Glu Leu Lys Asn Gly Phe Ala Val Val Arg Pro Pro Gly His His Ala 

Glu Glu Ser Thr Ala Met Gly Phe Cys Phe Phe Asn Ser Val Ala lie 

790 795 fin 

Thr Ala Lys Tyr Leu Arg Asp Gin Leu Asn lie Ser Lys lie Leu lie 
805 8io 



Val Asp Leu Asp Val His His Gly Asn Gly Thr Gin Gin Ala Pne Tyr 

Ala Asp Pro Ser He Leu Tyr lie Ser Leu His Arg Tyr Asp Glu Gly 

840 045 
Asn Phe Phe Pro Gly Ser Gly Ala Pro Asn Glu Val Gly Thr Gly Leu 

855 860 
865 ^ 55 ASn 116 Ala Tr P Thr G ^ Leu Asp Pro 



Pro Met Gly Asp Val Glu Tyr Leu Glu Ala Phe Arg Thr He Val Lys" 
istib 890 



870 ~ 875 880 

Lys 

Pro Val Ala Lys Glu Phe Asp Pro Asp Met Val Leu Val Ser III Gly 

Phe Asp Ala Leu Glu Gly His Thr Pro Pro Leu Gly Gly Tyr Lys Val 

27X3 920 925 

Thr Ala Lys Cys Phe Gly His Leu Thr Lys Gin Leu Met Thr Leu Ala 

935 940 
Asp Gly Arg Val Val Leu Ala Leu Glu Gly Gly His Asp Leu Thr Ala 

lie Cys Asp Ala Ser Glu Ala Cys Val Asn Ala Leu Leu Gly Asn G^u 

Leu Glu Pro Leu Ala Glu Asp He Leu hIs Gin Ser Pro Asn Met Asn 

_ yau 985 oon 

Ala Val lie Ser Leu Gin Lys He He Glu lie Gin Ser Met Ser Leu 

Lys Phe Ser 10 °° 1005 

1010 



<210> 3 

<211> 3499 

<212> DNA 

<213> Homo sapiens 

<400> 3 

ggggaagaga ggcacagaca cagataggag 
tgagggtttt tgcaacaaaa ccctagcagc 
ggacgagagc agctcttggc tcagcaaaga 
aagtcagaag ttcctgtggg cctggagccc 
aggatgatga tgcccgtggt ggaccctgtt 
cttcttatcc agcagcagca acaaatccag 
cagcatgaga acttgacacg gcagcaccag 
ctagccataa aacagcaaca agaactccta 
caagaacagg aagtagagag gcatcgcaga 
gatagaggac gagaaagggc agtggcaagt 
ctactgagta aatcagcaac qaaagacact 
cgccatccca agctctggta cacggctgcc 
ccccttagtg gaacatctcc atcctacaag 
gatgatttcc cccttcgaaa aactgcctct 
aaacagaaag tggcagagag gagaagcagc 
gtcacttcat tcaagaagcg aatgtttgag 
ccaggctctg gtcccagttc accaaacaat 
acttcggttt tgccccctac ccctcatgcc 



aagggcaccg gctggagcca cttgcaggac 60 
ctgaagaact ctaagccaga tggggtggct 120 
atgcacagta tgatcagctc agtggatgtg 180 
atctcacctt tagacctaag gacagacctc 240 
gtccgtgaga agcaattgca gcaggaatta 300 
aagcagcttc tgatagcaga gtttcagaaa 360 
gctcagcttc aggagcatat caaggaactt 420 
gaaaaggagc agaaactgga gcagcagagg 480 
gaacagcagc ttcctcctct cagaggcaaa 540 
acagaagtaa agcagaagct tcaagagttc 600 
ccaactaatg gaaaaaatca ttccgtgagc 660 
caccacacat cattggatca aagctctcca 720 
tacacattac caggagcaca agatgcaaag 780 
gagcccaact tgaaggtgcg gtccaggtta 840 
cccttactca ggcggaagga tggaaatgtt 900 
gtgacagaat cctcagtcag tagcagttct 960 
gggccaactg gaagtgttac tgaaaatgag 1020 
gagcaaatgg tttcacagca acgcattcta 1080 
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attcatgaag attccatgaa cctgctaagt ctttatacct ctccttcttt gcccaacatt 1140 
accttggggc ttcccgcagt gccatcccag ctcaatgctt cgaattcact caaagaaaag 1200 
cagaagtgtg agacgcagac gcttaggcaa ggtgttcctc tgcctgggca gtatggaggc 1260 
agcatcccgg catcttccag ccaccctcat gttactttag agggaaagcc acccaacagc 1320 
agccaccagg ctctcctgca gcatttatta ttgaaagaac aaatgcgaca gcaaaagctt 1380 
cttgtagctg gtggagttcc cttacatcct cagtctccct tggcaacaaa agagagaatt 1440 
tcacctggca ttagaggtac ccacaaattg ccccgtcaca gacccctgaa ccgaacccag 1500 
tctgcacctt tgcctcagag cacgttggct cagctggtca ttcaacagca acaccagcaa 1560 
ttcttggaga agcagaagca ataccagcag cagatccaca tgaacaaact gctttcgaaa 1620 
tctattgaac aactgaagca accaggcagt caccttgagg aagcagagga agagcttcag 1680 
ggggaccagg cgatgcagga agacagagcg ccctctagtg gcaacagcac taggagcgac 1740 
agcagtgctt gtgtggatga cacactggga caagttgggg ctgtgaaggt caaggaggaa 1800 
ccagtggaca gtgatgaaga tgctcagatc caggaaatgg aatctgggga gcaggctgct 1860 
tttatgcaac agcctttcct ggaacccacg cacacacgtg cgctctctgt gcgccaagct 1920 
ccgctggctg cggttggcat ggatggatta gagaaacacc gtctcgtctc caggactcac 1980 
tcttcccctg ctgcctctgt tttacctcac ccagcaatgg accgccccct ccagcctggc 2040 
tctgcaactg gaattgccta tgaccccttg atgctgaaac accagtgcgt ttgtggcaat 2100 
tccaccaccc accctgagca tgctggacga atacagagta tctggtcacg actgcaagaa 2160 
acbgggctgc taaataaatg tgagcgaatt caaggtcgaa aagccagcct ggaggaaata 2220 
cagcttgttc attctgaaca tcactcactg ttgtatggca ccaaccccct ggacggacag 2280 
aagctggacc ccaggatact cctaggtgat gactctcaaa agtttttttc ctcattacct 2340 
tgtggtggac ttggggtgga cagtgacacc atttggaatg agctacactc gtccggtgct 2400 
gcacgcatgg ctgttggctg tgtcatcgag ctggcttcca aagtggcctc aggagagctg 2460 
aagaatgggt ttgctgttgt gaggccccct ggccatcacg ctgaagaatc cacagccatg 2520 
gggttctgct tttttaattc agttgcaatt accgccaaat acttgagaga ccaactaaat 2580 
ataagcaaga tattgattgt agatctggat gttcaccatg gaaacggtac ccagcaggcc 2640 
ttttatgctg accccagcat cctgtacatt tcactccatc gctatgatga agggaacttt 2700 
ttccctggca gtggagcccc aaatgaggtt cggtttattt ctttagagcc ccacttttat 2760 
ttgtatcttt caggtaattg cattgcatga ttacccctaa ttttcttgtc ctttgctggt 2820 
gttttaaatt acacgagatt actgaattgt cccatgggac caagaaccag tgcagaacaa 2880 
gtgcataacc cagagcactg tttgtcaggg aaggttgggc tgatttgatg tgttgtttga 2940 
tgtttatttc aagagctccc atgtgcttgt tttcctctct tcttgctttc ttccatttgc 3000 
tctcttctct gcccaccgtg gtgtgtcttt ctcttcccag gttggaacag gccttggaga 3060 
agggtacaat ataaatattg cctggacagg tggccttgat cctcccatgg gagatgttga 3120 
gtaccttgaa gcattcagga ccatcgtgaa gcctgtggcc aaagagtttg atccagacat 3180 
ggtcttagta tctgctggat ttgatgcatt ggaaggccac acccctcctc taggagggta 3240 
caaagtgacg gcaaaatgtt ttggtcattt gacgaagcaa ttgatgacat tggctgatgg 3300 
acgtgtggtg ttggctctag aaggaggaca tgatctcaca gccatctgtg atgcatcaga 3360 
agcctgtgta aatgcccttc taggaaatga gctggagcca cttgcagaag atattctcca 3420 
ccaaagcccg aatatgaatg ctgttatttc tttacagaag atcattgaaa ttcaaagtat 3480 
gtctttaaag ttctcttaa 3499 

<210> 4 
<211> 879 
<212> PRT 

<213> Homo sapiens 



<400> 4 






























Met 


His 


Ser 


Met 


He 


Ser 


Ser 


Val 


Asp 


Val 


Lys 


Ser 


Glu 


Val 


Pro 


Val 


1 








5 










10 










15 




Gly 


Leu 


Glu 


Pro 
20 


He 


Ser 


Pro 


Leu 


Asp 
25 


Leu 


Arg 


Thr 


Asp 


Leu 
30 


Arg 


Met 


Met 


Met 


Pro 
35 


Val 


Val 


Asp 


Pro 


Val 
40 


Val 


Arg 


Glu 


Lys 


Gin 
45 


Leu 


Gin 


Gin 


Glu 


Leu 
50 


Leu 


Leu 


He 


Gin 


Gin 
55 


Gin 


Gin 


Gin 


He 


Gin 
60 


Lys 


Gin 


Leu 


Leu 


lie 


Ala 


Glu 


Phe 


Gin 


Lys 


Gin 


His 


Glu 


Asn 


Leu 


Thr 


Arg 


Gin 


His 


Gin 


65 










70 










75 










80 


Ala 


Gin 


Leu 


Gin 


Glu 
85 


His 


He 


Lys 


Glu 


Leu 
90 


Leu 


Ala 


He 


Lys 


Gin 
95 


Gin 


Gin 


Glu 


Leu 


Leu 
100 


Glu 


Lys 


Glu 


Gin 


Lys 
105 


Leu 


Glu 


Gin 


Gin 


Arg 
110 


Gin 


Glu 
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Gin Glu Val 
115 

Gly Lys Asp 

130 
Gin Lys Leu 
145 

Pro Thr Asn 

Tyr Thr Ala 

Ser Gly Thr 
195 

Ala Lys Asp 
210 

Lys Val Arg 
225 

Pro Leu Leu 

Arg Met Phe 

Ser Gly Pro 
275 

Asn Glu Thr 

290 
Ser Gin Gin 
305 

Leu Tyr Thr 

Val Pro Ser 

Cys Glu Thr 
355 

Gly Gly Ser 

370 
Gly Lys Pro 
385 

Leu Lys Glu 



Glu Arg His 

Arg Gly Arg 

Gin Glu Phe 
150 

Gly Lys Asn 

165 
Ala His His 
180 

Ser Pro Ser 

Asp Phe Pro 

Ser Arg Leu 
230 

Arg Arg Lys 

245 
Glu Val Thr 
260 

Ser Ser Pro 

Ser Val Leu 

Arg He Leu 
. 310 
Ser Pro Ser 

325 
Gin Leu Asn 
340 

Gin Thr Leu 
He Pro Ala 



Arg Arg 
120 
Glu Arg 
135 

Leu Leu 

His Ser 

Thr Ser 

Tyr Lys 
200 
Leu Arg 
215 

Lys Gin 

Asp Gly 

Glu Ser 

Asn Asn 
280 
Pro Pro 
295 

He His 



Glu Gin Gin 

Ala Val Ala 

Ser Lys Ser 
155 

Val Ser Arg 

170 
Leu Asp Gin 
185 

Tyr Thr Leu 

Lys Thr Ala 

Lys Val Ala 
235 

Asn Val Val 

250 
Ser Val Ser 
265 

Gly Pro Thr 
Thr Pro His 



Pro Leu His 

Gly He Arg 
435 

Thr Gin Ser 

450 
Gin Gin Gin 
465 

Gin He His 



Pro Asn Ser 
390 

Gin Met Arg 

405 
Pro Gin Ser 
420 

Gly Thr His 
Ala Pro Leu 



Leu Pro 

Ala Ser 

Arg Gin 
360 
Ser Ser 
375 

Ser His 



His Gin Gin 
470 

Met Asn Lys 
485 

Gin Pro Gly Ser His Leu 
500 

Gin Glu Asp 



Gin Ala Met 
515 

Ser Asp Ser 

530 
Val Lys Val 
545 

Gin Glu Met 

Leu Glu Pro 

Ala Ala Val 
595 



Ser Ala Cys 

Lys Glu Glu 
550 

Glu Ser Gly 

565 
Thr His Thr 
580 

Gly Met Asp 



Gin Gin 

Pro Leu 

Lys Leu 
440 
Pro Gin 
455 

Phe Leu 

Leu Leu 

Glu Glu 

Arg Ala 
520 
Val Asp 
535 

Pro Val 

Glu Gin 

Arg Ala 

Gly Leu 
600 



Glu Asp Ser 
315 

Asn He Thr 

330 
Asn Ser Leu 
345 

Gly Val Pro 

Ser His Pro 

Gin Ala Leu 
395 

Lys Leu Leu 

410 
Ala Thr 'Lys 
425 

Pro Arg His 

Ser Thr Leu 

Glu Lys Gin 
475 

Ser Lys Ser 

490 
Ala Glu Glu 
505 

Pro Ser Ser 



Leu Pro 
125 
Ser Thr 
140 

Ala Thr 

His Pro 

Ser Ser 

Pro Gly 
205 
Ser Glu 
220 

Glu Arg 

Thr Ser 

Ser Ser 

Gly Ser 
285 
Ala Glu 
300 

Met Asn 
Leu Gly 
Lys Glu 



Leu Pro 
365 
His Val 
380 

Leu Gin 

Val Ala 

Glu Arg 

Arg Pro 
445 
Ala Gin 
460 

Lys Gin 
He Glu 
Glu Leu 



Pro Leu Arg 

Glu Val Lys 

Lys Asp Thr 
160 

Lys Leu Trp 
175 

Pro Pro Leu 
190 

Ala Gin Asp 

Pro Asn Leu 

Arg Ser Ser 
240 

Phe Lys Lys 
255 

Ser Pro Gly 
270 

Val Thr Glu 

Gin Met Val 

Leu Leu Ser 
320 

Leu Pro Ala 
335 

Lys Gin Lys 
350 

Gly Gin Tyr 

Thr Leu Glu 

His Leu Leu 
400 

Gly Gly Val 

415 
He Ser Pro 
430 

Leu Asn Arg 

Leu Val He 

Tyr Gin Gin 
480 

Gin Leu Lys 
495 

Gin Gly Asp 
510 

Ser Thr Arg 



Asp Thr Leu 

Asp Ser Asp 
555 

Ala Ala Phe 

570 
Leu Ser Val 
585 

Glu Lys His 



Gly Asn 
525 

Gly Gin Val Gly Ala 
540 

Glu Asp 



Met Gin 

Arg Gin 

Arg Leu 
605 



Ala Gin He 
560 

Gin Pro Phe 

575 
Ala Pro Leu 
590 

Val Ser Arg 
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Thr 


His 


Ser 


Ser 


Pro Ala 


Ala Ser Val Leu 


Pro 


His 


Pro 


Ala 


Met 


Asp 




610 








615 




620 










Arg 


Pro 


Leu 


Gin 


Pro Gly 


Ser Ala Thr Gly 


He 


Ala 


Tyr 


Asp 


Pro 


Leu 


625 








630 




635 










640 


Met 


Leu 


Lys 


His 


Gin Cys 


Val Cys Gly Asn 


Ser 


Thr 


Thr 


His 


Pro 


Glu 










645 


650 










655 




His 


Ala 


Gly 


Arg 


He Gin 


Ser He Trp Ser 


Arg 


Leu 


Gin 


Glu 


Thr 


Gly 








660 




665 








670 






Leu 


Leu 


Asn 


Lys 


Cys Glu 


Arg He Gin Gly 


Arg 


Lys 


Ala 


Ser 


Leu 


Glu 






675 






680 






685 








Glu 


He 


Gin 


Leu 


Val His 


Ser Glu His His 


Ser 


Leu 


Leu 


Tyr 


Gly 


Thr 




690 








695 




700 










Asn 


Pro 


Leu 


Asp Gly Gin 


Lys Leu Asp. Pro 


Arg 


He 


Leu 


Leu 


Gly 


Asp 


705 








710 




715 










720 


Asp 


Ser 


Gin 


Lys 


Phe Phe 


Ser Ser Leu Pro 


Cys 


Gly 


Gly 


Leu 


Gly 


Val 










725 


730 










735 




Asp 


Ser 


Asp 


Thr 


He Trp 


Asn Glu Leu His 


Ser 


Ser 


Gly 


Ala 


Ala 


Arg 








740 




745 








750 






Met 


Ala 


Val 


Gly Cys Val 


He Glu Leu Ala 


Ser 


Lys 


Val 


Ala 


Ser 


Gly 






755 






760 






765 








Glu 


Leu 


Lys 


Asn Gly Phe 


Ala Val Val Arg 


Pro 


Pro 


Gly 


His 


His 


Ala 




770 








775 




780 










Glu 


Glu 


Ser 


Thr 


Ala Met 


Gly Phe Cys Phe 


Phe 


Asn 


Ser 


Val 


Ala 


He 


785 








790 




795 










800 


Thr 


Ala 


Lys 


Tyr 


Leu Arg 


Asp Gin Leu Asn 


He 


Ser 


Lys 


He 


Leu 


He 










805 


810 










815 




Val 


Asp 


Leu 


Asp 


Val His 


His Gly Asn Gly 


Thr 


Gin 


Gin 


Ala 


Phe 


Tyr 








820 




825 








830 






Ala 


Asp 


Pro 


Ser 


lie Leu 


Tyr He Ser Leu 


His 


Arg 


Tyr 


Asp 


Glu 


Gly 






835 






840 






845 








Asn 


Phe 


Phe 


Pro 


Gly Ser 


Gly Ala Pro Asn 


Glu 


Val 


Arg 


Phe 


He 


Ser 




850 








855 




860 










Leu 


Glu 


Pro 


His 


Phe Tyr 


Leu Tyr Leu Ser 


Gly 


Asn 


Cys 


He 


Ala 




865 








870 




875 













<210> 5 

<211> 3054 

<212> DNA 

<213> Homo sapiens 

<400> 5 

ggggaagaga ggcacagaca cagataggag 
tgagggtttt tgcaacaaaa ccctagcagc 
ggacgagagc agctcttggc tcagcaaaga 
aagtcagaag .ttcctgtggg cctggagccc 
aggatgatga tgcccgtggt ggaccctgtt 
cttcttatcc agcagcagca acaaatccag 
cagcatgaga acttgacacg gcagcaccag 
ctagccataa aacagcaaca agaactccta 
caagaacagg aagtagagag gcatcgcaga 
gatagaggac gagaaagggc agtggcaagt 
ctactgagta aatcagcaac gaaagacact 
cgccatccca agctctggta cacggctgcc 
ccccttagtg gaacatctcc atcctacaag 
gatgatttcc cccttcgaaa aactgaatcc 
cccagttcac caaacaatgg gccaactgga 
ccccctaccc ctcatgccga gcaaatggtt 
tccatgaacc tgctaagtct ttatacctct 
cccgcagtgc catcccagct caatgcttcg 
acgcagacgc ttaggcaagg tgttcctctg 
tcttccagcc accctcatgt tactttagag 



aagggcaccg gctggagcca cttgcaggac 60 
ctgaagaact ctaagccaga tggggtggct 120 
atgcacagta tgatcagctc agtggatgtg 180 
atctcacctt tagacctaag gacagacctc 240 
gtccgtgaga agcaattgca gcaggaatta 300 
aagcagcttc tgatagcaga gtttcagaaa 360 
gctcagcttc aggagcatat caaggaactt 420 
gaaaaggagc agaaactgga gcagcagagg 480 
gaacagcagc ttcctcctct cagaggcaaa 540 
acagaagtaa agcagaagct tcaagagttc 600 
ccaactaatg gaaaaaatca ttccgtgagc 660 
caccacacat cattggatca aagctctcca 720 
tacacattac caggagcaca agatgcaaag 780 
tcagtcagta gcagttctcc aggctctggt 840 
agtgttactg aaaatgagac ttcggttttg 900 
tcacagcaac gcattctaat tcatgaagat 960 
ccttctttgc ccaacattac cttggggctt 1020 
aattcactca aagaaaagca gaagtgtgag 1080 
cctgggcagt atggaggcag catcccggca 1140 
ggaaagccac ccaacagcag ccaccaggct 1200 
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ctcctgcagc atttattatt gaaagaacaa atgcgacagc aaaagcttct tgtagctggt 1260 
ggagttccct tacatcctca gtctcccttg gcaacaaaag agagaatttc acctggcatt 1320 
agaggtaccc acaaattgcc ccgtcacaga cccctgaacc gaacccagtc tgcacctttg 1380 
cctcagagca cgttggctca gctggtcatt caacagcaac accagcaatt cttggagaag 1440 
cagaagcaat accagcagca gatccacatg aacaaactgc tttcgaaatc tattgaacaa 1500 
ctgaagcaac caggcagtca ccttgaggaa gcagaggaag agcttcaggg ggaccaggcg 1560 
atgcaggaag acagagcgcc ctctagtggc aacagcacta ggagcgacag cagtgcttgt 1620 
gtggatgaca cactgggaca agttggggct gtgaaggtca aggaggaacc agtggacagt 1680 
gatgaagatg ctcagatcca ggaaatggaa tctggggagc aggctgcttt tatgcaacag 1740 
cctttcctgg aacccacgca cacacgtgcg ctctctgtgc gccaagctcc gctggctgcg 1800 
gttggcatgg atggattaga gaaacaccgt ctcgtctcca ggactcactc ttcccctgct 1860 
gcctctgttt tacctcaccc agcaatggac cgccccctcc agcctggctc tgcaactgga 1920 
attgcctatg accccttgat gctgaaacac cagtgcgttt gtggcaattc caccacccac 1980 
cctgagcatg ctggacgaat acagagtatc tggtcacgac tgcaagaaac tgggctgcta 2040 
aataaatgtg agcgaattca aggtcgaaaa gccagcctgg aggaaataca gcttgttcat 2100 
tctgaacatc actcactgtt gtatggcacc aaccccctgg acggacagaa gctggacccc 2160 
aggatactcc taggtgatga ctctcaaaag tttttttcct cattaccttg tggtggactt 2220 
ggggtggaca gtgacaccat ttggaatgag ctacactcgt ccggtgctgc acgcatggct 2280 
gttggctgtg tcatcgagct ggcttccaaa gtggcctcag gagagctgaa gaatgggttt 2340 
?? ; g :? 93 sgccccctgg ccatcacgct gaagaatcca cagccatggg gttctgcttt 2400 
tttaattcag ttgcaattac cgccaaatac ttgagagacc aactaaatat aagcaagata 2460 
ttgattgtag atctggatgt tcaccatgga aacggtaccc agcaggcctt ttatgctgac 2520 
cccagcatcc tgtacatttc actccatcgc tatgatgaag ggaacttttt ccctggcagt 2580 
ggagccccaa atgaggttgg aacaggcctt ggagaagggt acaatataaa tattgcctgg 2640 
acaggtggcc ttgatcctcc catgggagat gttgagtacc ttgaagcatt caggaccatc 2700 
gtgaagcctg tggccaaaga gtttgatcca gacatggtct tagtatctgc tggatttgat 2760 
gcattggaag gccacacccc tcctctagga gggtacaaag tgacggcaaa atgttttggt 2820 
catttgacga agcaattgat gacattggct gatggacgtg tggtgttggc tctagaagga 2880 
ggacatgatc tcacagccat ctgtgatgca tcagaagcct gtgtaaatgc ccttctagga 2940 
^?^ a ?^ gg a ^ ccactt Sc agaagatatt ctccaccaaa gcccgaatat gaatgctgtt 3000 
atttctttac agaagatcat tgaaattcaa agtatgtctt taaagttctc ttaa 3054 

<210> 6 
<211> 967 
<212> PRT 

<213> Homo sapiens 
<400> 6 

Met His Ser Met He Ser Ser Val Asp Val Lys Ser Glu Val Pro Val 

5 10 15 

Gly Leu Glu Pro He Ser Pro Leu Asp Leu Arg Thr Asp Leu Arg Met 

20 25 30 

Met Met Pro Val Val Asp Pro Val Val Arg Glu Lys Gin Leu Gin Gin 

35 40 45 

Glu Leu Leu Leu He Gin Gin Gin Gin Gin He Gin Lys Gin Leu Leu 

_ 50 55 60 

lie Ala Glu Phe Gin Lys Gin His Glu Asn Leu Thr Arg Gin His Gin 
65 70 75 80 

Ala Gin Leu Gin Glu His He Lys Glu Leu Leu Ala He Lys Gin Gin 

85 90 55 

Gin Glu Leu Leu Glu Lys Glu Gin Lys Leu Glu Gin Gin Arg Gin Glu 

100 105 no 

Gin Glu Val Glu Arg His Arg Arg Glu Gin Gin Leu Pro Pro Leu Ara 

115 120 125 

Gly Lys Asp Arg Gly Arg Glu Arg Ala Val Ala Ser Thr Glu Val Lvs 

130 135 140 

Gin Lys Leu Gin Glu Phe Leu Leu Ser Lys Ser Ala Thr Lys Asp Thr 
i 45 fc 150 155 160 

Pro Thr Asn Gly Lys Asn His Ser Val Ser Arg His Pro Lys Leu Trp 

165 170 175 

Tyr Thr Ala Ala His His Thr Ser Leu Asp Gin Ser Ser Pro Pro Leu 
180 185 190 
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Ser 


Gly 


Thr 


Ser 


Pro 


Ser 


Tyr Lys 


Tyr 


Thr 


Leu 


Pro 


Gly 


Ala 


Gin 


Asp 






195 








200 










205 








Ala 


Lys 


Asp 


Asp 


Phe 


Pro 


Leu Arg 


Lys 


Thr 


Glu 


Ser 


Ser 


Val 


Ser 


Ser 




210 










215 








220 










Ser 


Ser 


Pro 


Gly 


Ser Gly 


Pro Ser 


Ser 


Pro 


Asn 


Asn 


Gly 


Pro 


Thr 


Gly 


225 








230 








235 










240 


Ser 


Val 


Thr 


Glu 


Asn 


Glu 


Thr Ser 


Val 


Leu 


Pro 


Pro 


Thr 


Pro 


His 


Ala 










245 








250 










255 




Glu 


Gin 


Met 


Val 


Ser 


Gin 


Gin Arg 


He 


Leu 


He 


His 


Glu 


Asp 


Ser 


Met 








260 








265 










270 






Asn 


Leu 


Leu 


Ser 


Leu 


Tyr 


Thr Ser 


Pro 


Ser 


Leu 


Pro 


Asn 


He 


Thr 


Leu 






275 








280 










285 








Gly 


Leu 


Pro 


Ala 


Val 


Pro 


Ser Gin 


Leu 


Asn 


Ala 


Ser 


Asn 


Ser 


Leu 


Lys 




290 










295 








300 










Glu 


Lys 


Gin 


Lys 


Cys Glu 


Thr Gin 


Thr 


Leu 


Arg 


Gin 


Gly 


Val 


Pro 


Leu 


305 










310 








315 










320 


Pro 


Gly 


Gin 


Tyr Gly Gly 


Ser He 


Pro 


Ala 


Ser 


Ser 


Ser 


His 


Pro 


His 










325 








330 










335 




Val 


Thr 


Leu 


Glu Gly Lys 


Pro Pro 


Asn 


Ser 


Ser 


His 


Gin 


Ala 


Leu 


Leu 








340 








345 










350 






Gin 


His 


Leu 


Leu Leu Lys 


Glu Gin 


Met 


Arg 


Gin 


Gin 


Lys 


Leu 


Leu 


Val 






355 








360 










365 








Ala 


Gly 


Gly 


Val 


Pro 


Leu 


His Pro 


Gin 


Ser 


Pro 


Leu 


Ala 


Thr 


Lys 


Glu 




370 










375 








380 










Arg 


He 


Ser 


Pro Gly 


He 


Arg Gly Thr His 


Lys 


Leu 


Pro 


Arg 


His 


Arg 


385 










390 








395 










400 


Pro 


Leu 


Asn 


Arg 


Thr 


Gin 


Ser Ala 


Pro 


Leu 


Pro 


Gin 


Ser 


Thr 


Leu 


Ala 










405 








410 










415 




Gin 


Leu 


Val 


He 


Gin 


Gin 


Gin His 


Gin Gin 


Phe 


Leu 


Glu 


Lys 


Gin 


Lys 








420 








425 










430 






Gin 


Tyr 


Gin 


Gin 


Gin 


He 


His Met 


Asn 


Lys 


Leu 


Leu 


Ser 


Lys 


Ser 


He 






435 








440 










445 








Glu 


Gin 


Leu 


Lys 


Gin 


Pro 


Gly Ser His 


Leu 


Glu 


Glu 


Ala 


Glu 


Glu 


Glu 




450 










455 








460 










I>eu 


Gin 


Gly 


Asp 


Gin 


Ala 


Met Gin 


Glu 


Asp 


Arg 


Ala 


Pro 


Ser 


Ser 


Gly 


465 










470 








475 










480 


Asn 


Ser 


Thr 


Arg 


Ser 


Asp 


Ser Ser 


Ala 


Cys 


Val 


Asp 


Asp 


Thr 


Leu 


Gly 










485 








490 










495 




Gin 


Val 


Gly 


Ala 


Val 


Lys 


Val Lys 


Glu 


Glu 


Pro 


Val 


Asp 


Ser 


Asp 


Glu 








500 








505 










510 






Asp 


Ala 


Gin 


He 


Gin 


Glu 


Met Glu 


Ser 


Gly 


Glu 


Gin 


Ala 


Ala 


Phe 


Met 






515 








520 










525 








Gin 


Gin 


Pro 


Phe 


Leu 


Glu 


Pro Thr 


His 


Thr 


Arg 


Ala 


Leu 


Ser 


Val 


Arg 




530 










535 








540 










Gin 


Ala 


Pro 


Leu 


Ala 


Ala 


Val Gly Met Asp 


Gly 


Leu 


Glu 


Lys 


His 


Arg 


545 










550 








555 










560. 


Leu 


Val 


Ser 


Arg 


Thr 


His 


Ser Ser 


Pro 


Ala 


Ala 


Ser 


Val 


Leu 


Pro 


His 










565 








570 










575 




Pro 


Ala 


Met 


Asp 


Arg 


Pro 


Leu Gin Pro Gly 


Ser 


Ala 


Thr 


Gly 


He 


Ala 








580 








585 










590 






Tyr 


Asp 


Pro 


Leu 


Met 


Leu 


Lys His 


Gin 


Cys 


Val 


Cys 


Gly 


Asn 


Ser 


Thr 






595 








600 










605 








Thr 


His 


Pro 


Glu 


His 


Ala 


Gly Arg He Gin 


Ser 


He 


Trp 


Ser 


Arg 


Leu 




610 










615 








620 










Gin 


Glu 


Thr 


Gly Leu Leu 


Asn Lys Cys 


Glu 


Arg 


He 


Gin 


Gly 


Arg 


Lys 


625 










630 








635 










640 


Ala 


Ser 


Leu 


Glu 


Glu 


He 


Gin Leu Val 


His 


Ser 


Glu 


His 


His 


Ser 


Leu 










645 








650 










655 




Leu 


Tyr 


Gly 


Thr 


Asn 


Pro 


Leu Asp Gly Gin 


Lys 


Leu 


Asp 


Pro 


Arg 


He 








660 








665 










670 






Leu 


Leu 


Gly 


Asp Asp 


Ser 


Gin Lys 


Phe 


Phe 


Ser 


Ser 


Leu 


Pro 


Cys 


Gly 






675 








680 










685 
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Gly Leu 
690 
Gly Ala 
705 

Val Ala 

Gly His 

Ser Val 

Lys lie 
770 
Gin Ala 
785 

Tyr Asp 

Gly Thr 

Gly Leu 

Thr He 
850 
Val Ser 
865 

Gly Tyr 

Met Thr 

Asp Leu 

Leu Gly 
930 
Pro Asn 
945 

Ser Met 



Gly Val Asp Ser 
Ala Arg 
Ser Gly 



His Ala 
740 
Ala He 
755 

Leu He 

Phe Tyr 

Glu Gly 

Gly Leu 
820 
Asp Pro 
835 

Val Lys 

Ala Gly 

Lys Val 

Leu Ala 
900 
Thr Ala 
915 

Asn Glu 
Met Asn 
Ser Leu 



Met Ala 
710 
Glu Leu 
725 

Glu Glu 



Asp Thr He 
695 

Val Gly Cys 



Trp Asn 



Val He 
715 

Lys Asn Gly Phe Ala 
730 

Met Gly 



Thr Ala 

Val Asp 

Ala Asp 
790 
Asn Phe 
805 

Gly Glu 

Pro Met 

Pro Val 

Phe Asp 
870 
Thr Ala 
885 

Asp Gly 

He Cys 

Leu Glu 

Ala Val 
950 
Lys Phe 
965 



Ser Thr Ala 
745 

Lys Tyr Leu 

760 
Leu Asp Val 
775 

Pro Ser He 

Phe Pro Gly 

Gly Tyr Asn 
825 

Gly Asp Val 

840 
Ala Lys Glu 
855 

Ala Leu Glu 

Lys Cys Phe 

Arg Val Val 
905 

Asp Ala Ser 

920 
Pro Leu Ala 
935 

He Ser Leu 
Ser 



Arg Asp 

His His 

Leu Tyr 
795 
Ser Gly 
810 

He Asn 

Glu Tyr 

Phe Asp 

Gly His 
875 
Gly His 
890 

Leu Ala 

Glu Ala 

Glu Asp 

Gin Lys 
955 



Glu Leu 
700 

Glu Leu 

Val Val 

Phe Cys 

Gin Leu 
765 
Gly Asn 
780 

He Ser 

Ala Pro 

He Ala 

Leu Glu 
845 
Pro Asp 
860 

Thr Pro 

Leu Thr 

Leu Glu 

Cys Val 
925 
He Leu 
940 

He He 



His Ser Ser 

Ala Ser Lys 
720 

Arg Pro Pro 

735 
Phe Phe Asn 
750 

Asn He Ser 

Gly Thr Gin 

Leu His Arg 
800 

Asn Glu Val 

815 
Trp Thr Gly 
830 

Ala Phe Arg 

Met Val Leu 

Pro Leu Gly 
880 

Lys Gin Leu 

895 
Gly Gly His 
910 

Asn Ala Leu 

His Gin Ser 

Glu lie Gin 
960 



<210> 7 

<211> 3367 

<212> DNA 

<213> Homo sapiens 

<400> 7 

ggggaagaga ggcacagaca cagataggag 
tgagggtttt tgcaacaaaa ccctagcagc 
ggacgagagc agctcttggc tcagcaaaga 
aagtcagaag ttcctgtggg cctggagccc 
aggatgatga tgcccgtggt ggaccctgtt 
cttcttatcc agcagcagca acaaatccag 
cagcatgaga acttgacacg gcagcaccag 
ctagccataa aacagcaaca agaactccta 
caagaacagg aagtagagag gcatcgcaga 
gatagaggac gagaaagggc agtggcaagt 
ctactgagta aatcagcaac gaaagacact 
cgccatccca agctctggta cacggctgcc 
ccccttagtg gaacatctcc atcctacaag 
gatgatttcc cccttcgaaa aactgaatcc 
cccagttcac caaacaatgg gccaactgga 
ccccctaccc ctcatgccga gcaaatggtt 
tccatgaacc tgctaagtct ttatacctct 
cccgcagtgc catcccagct caatgcttcg 



aagggcaccg gctggagcca cttgcaggac 60 
ctgaagaact ctaagccaga tggggtggct 120 
atgcacagta tgatcagctc agtggatgtg 180 
atctcacctt tagacctaag gacagacctc 240 
gtccgtgaga agcaattgca gcaggaatta 300 
aagcagcttc tgatagcaga gtttcagaaa 360 
gctcagcttc aggagcatat caaggaactt 420 
gaaaaggagc agaaactgga gcagcagagg 480 
gaacagcagc ttcctcctct cagaggcaaa 540 
acagaagtaa agcagaagct tcaagagttc 600 
ccaactaatg gaaaaaatca ttccgtgagc 660 
caccacacat cattggatca aagctctcca 720 
tacacattac caggagcaca agatgcaaag 780 
tcagtcagta gcagttctcc aggctctggt 840 
agtgttactg aaaatgagac ttcggttttg 900 
tcacagcaac gcattctaat tcatgaagat 960 
ccttctttgc ccaacattac cttggggctt 1020 
aattcactca aagaaaagca gaagtgtgag 1080 
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acgcagacgc ttaggcaagg tgttcctctg cctgggcagt atggaggcag catcccggca 1140 
tcttccagcc accctcatgt tactttagag ggaaagccac ccaacagcag ccaccaggct 1200 
ctcctgcagc atttattatt gaaagaacaa atgcgacagc aaaagcttct tgtagctggt 1260 
ggagttccct tacatcctca gtctcccttg gcaacaaaag agagaatttc acctggcatt 1320 
agaggtaccc acaaattgcc ccgtcacaga cccctgaacc gaacccagtc tgcacctttg 1380 
cctcagagca cgttggctca gctggtcatt caacagcaac accagcaatt cttggagaag 1440 
cagaagcaat accagcagca gatccacatg aacaaactgc tttcgaaatc tattgaacaa 1500 
ctgaagcaac caggcagtca ccttgaggaa gcagaggaag agcttcaggg ggaccaggcg 1560 
atgcaggaag acagagcgcc ctctagtggc aacagcacta ggagcgacag cagtgcttgt 1620 
gtggatgaca cactgggaca agttggggct gtgaaggtca aggaggaacc agtggacagt 1680 
gatgaagatg ctcagatcca ggaaatggaa tctggggagc aggctgcttt tatgcaacag 1740 
cctttcctgg aacccacgca cacacgtgcg ctctctgtgc gccaagctcc gctggctgcg 1800 
gttggcatgg atggattaga gaaacaccgt ctcgtctcca ggactcactc ttcccctgct 1860 
gcctctgttt tacctcaccc agcaatggac cgccccctcc agcctggctc tgcaactgga 1920 
attgcctatg accccttgat gctgaaacac cagtgcgttt gtggcaattc caccacccac 1980 
cctgagcatg ctggacgaat acagagtatc tggtcacgac tgcaagaaac tgggctgcta 2040 
aataaatgtg agcgaattca aggtcgaaaa gccagcctgg aggaaataca gcttgttcat 2100 
tctgaacatc actcactgtt gtatggcacc aaccccctgg acggacagaa gctggacccc 2160 
aggatactcc taggtgatga ctctcaaaag tttttttcct cattaccttg tggtggactt 2220 
ggggtggaca gtgacaccat ttggaatgag ctacactcgt ccggtgctgc acgcatggct 2280 
gttggctgtg tcatcgagct ggcttccaaa gtggcctcag gagagctgaa gaatgggttt 2340 
gctgttgtga ggccccctgg ccatcacgct gaagaatcca cagccatggg gttctgcttt 2400 
tttaattcag ttgcaattac cgccaaatac ttgagagacc aactaaatat aagcaagata 2460 
ttgattgtag atctggatgt tcaccatgga aacggtaccc agcaggcctt ttatgctgac 2520 
cccagcatcc tgtacatttc actccatcgc tatgatgaag ggaacttttt ccctggcagt 2580 
ggagccccaa atgaggttcg gtttatttct ttagagcccc acttttattt gtatctttca 2640 
ggtaattgca ttgcatgatt acccctaatt ttcttgtcct ttgctggtgt tttaaattac 2700 
acgagattac tgaattgtcc catgggacca agaaccagtg cagaacaagt gcataaccca 2760 
gagcactgtt tgtcagggaa ggttgggctg atttgatgtg ttgtttgatg tttatttcaa 2820 
gagctcccat gtgcttgttt tcctctcttc ttgctttctt ccatttgctc tcttctctgc 2880 
ccaccgtggt gtgtctttct cttcccaggt tggaacaggc cttggagaag ggtacaatat 2940 
aaatattgcc tggacaggtg gccttgatcc tcccatggga gatgttgagt accttgaagc 3000 
attcaggacc atcgtgaagc ctgtggccaa agagtttgat ccagacatgg tcttagtatc 3 060 
tgctggattt gatgcattgg aaggccacac ccctcctcta ggagggtaca aagtgacggc 3120 
aaaatgtttt ggtcatttga cgaagcaatt gatgacattg gctgatggac gtgtggtgtt 3180 
ggctctagaa ggaggacatg atctcacagc catctgtgat gcatcagaag cctgtgtaaa 3240 
tgcccttcta ggaaatgagc tggagccact tgcagaagat attctccacc aaagcccgaa 3300 
tatgaatgct gttatttctt tacagaagat cattgaaatt caaagtatgt ctttaaagtt 3360 
ctcttaa 3367 

<210> 8 
<211> 835 
<212> PRT 

<213> Homo sapiens 



<400> 8 



Met 


His 


Ser 


Met 


He 


Ser 


Ser 


Val 


Asp 


Val 


Lys 


Ser 


Glu 


Val 


Pro 


Val 


1 








5 










10 










15 




Gly 


Leu 


Glu 


Pro 
20 


lie 


Ser 


Pro 


Leu 


Asp 
25 


Leu 


Arg 


Thr 


Asp 


Leu 
30 


Arg 


Met 


Met 


Met 


Pro 
35 


Val 


Val 


Asp 


Pro 


Val 
40 


Val. 


Arg 


Glu 


Lys 


Gin 
45 


Leu 


Gin 


Gin 


Glu 


Leu 


Leu 


Leu 


lie 


Gin 


Gin 


Gin 


Gin 


Gin 


lie 


Gin 


Lys 


Gin 


Leu 


Leu 




50 










55 










60 








He 


Ala 


Glu 


Phe 


Gin 


Lys 


Gin 


His 


Glu 


Asn 


Leu 


Thr 


Arg 


Gin 


His 


Gin 


65 










70 










75 








80 


Ala 


Gin 


Leu 


Gin 


Glu 


His 


lie 


Lys 


Glu 


Leu 


Leu 


Ala 


Xle 


Lys 


Gin 


Gin 










85 










90 








95 




Gin 


Glu 


Leu 


Leu 
100 


Glu 


Lys 


Glu 


Gin 


Lys 
105 


Leu 


Glu 


Gin 


Gin 


Arg 
110 


Gin 


Glu 


Gin 


Glu 


Val 
115 


Glu 


Arg 


His 


Arg 


Arg 
120 


Glu 


Gin 


Gin 


Leu 


Pro 
125 


Pro 


Leu 


Arg 
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Gly Lys Asp Arg Gly Arg Glu Arg Ala Val Ala Ser Thr Glu Val Lys 

130 135 i4 0 

Gin Lys Leu Gin Glu Phe Leu Leu Ser Lys Ser Ala Thr Lys Asp Thr 

155 160 
Pro Thr Asn Gly Lys Asn His Ser Val Ser Arg His Pro Lys Leu Trp 

165 170 175 

Tyr Thr Ala Ala His His Thr Ser Leu Asp Gin Ser Ser Pro Pro Leu 

180 185 190 

Ser Gly Thr Ser Pro Ser Tyr Lys Tyr Thr Leu Pro Gly Ala Gin Asp 

195 200 205 

Ala Lys Asp Asp Phe Pro Leu Arg Lys Thr Glu Ser Ser Val Ser Ser 
„ 210 215 220 

Ser Ser Pro Gly Ser Gly Pro Ser Ser Pro Asn Asn Gly Pro Thr Gly 

230 235 240 

Ser Val Thr Glu Asn Glu Thr Ser Val Leu Pro Pro Thr Pro His Ala 

245 250 255 

Glu Gin Met Val Ser Gin Gin Arg lie Leu He His Glu Asp Ser Met 

250 265 270 

Asn Leu Leu Ser Leu Tyr Thr Ser Pro Ser Leu Pro Asn He Thr Leu 

275 280 285 

Gly Leu Pro Ala Val Pro Ser Gin Leu Asn Ala Ser Asn Ser Leu Lys 

290 295 300 

Glu Lys Gin Lys Cys Glu Thr Gin Thr Leu Arg Gin Gly Val Pro Leu 

310 315 
Pro Gly Gin Tyr Gly Gly Ser He Pro Ala Ser Ser Ser His Pro His 

325 330 335 

Val Thr Leu Glu Gly Lys Pro Pro Asn Ser Ser His Gin Ala Leu Leu 

340 345 350 

Gin His Leu Leu Leu Lys Glu Gin Met Arg Gin Gin Lys Leu Leu Val 

355 360 3 6 5 

Ala Gly Gly Val Pro Leu His Pro Gin Ser Pro Leu Ala Thr Lys Glu 

370 375 380 

Arg He Ser Pro Gly He Arg Gly Thr His Lys Leu Pro Arg His Arg 
t 390 395 " 4Q0 

Pro Leu Asn Arg Thr Gin Ser Ala Pro Leu Pro Gin Ser Thr Leu Ala 

405 410 415 

Gin Leu Val He Gin Gin Gin His Gin Gin Phe Leu Glu Lys Gin Lys 

42 0 425 43 0 

Gin Tyr Gin Gin Gin He His Met Asn Lys Leu Leu Ser Lys Ser He 

435 440 445 

Glu Gin Leu Lys Gin Pro Gly Ser His Leu Glu Glu Ala Glu Glu Glu 

4 ?° 455 460 

Leu Gin Gly Asp Gin Ala Met Gin Glu Asp Arg Ala Pro Ser Ser Gly 
> „ 470 4 75 480 

Asn Ser Thr Arg Ser Asp Ser Ser Ala Cys Val Asp Asp Thr Leu Gly 

485 . 490 495 

Gin Val Gly Ala Val Lys Val Lys Glu Glu Pro Val Asp Ser Asp Glu 

500 505 510 

Asp Ala Gin He Gin Glu Met Glu Ser Gly Glu Gin Ala Ala Phe Met 

515 520 525 

Gin Gin Pro Phe Leu Glu Pro Thr His Thr Arg Ala Leu Ser Val Arg 

535 540 
Gin Ala Pro Leu Ala Ala Val Gly Met Asp Gly Leu Glu Lys His Arg 

550 555 
Leu Val Ser Arg Thr His Ser Ser Pro Ala Ala Ser Val Leu Pro His 

r, 565 570 575 

Pro Ala Met Asp Arg Pro Leu Gin Pro Gly Ser Ala Thr Gly He Ala 

580 585 590 

Tyr Asp Pro Leu Met Leu Lys His Gin Cys Val Cys Gly Asn Ser Thr 

595 600 605 

Thr His Pro Glu His Ala Gly Arg He Gin Ser He Trp Ser Arg Leu 
510 615 620 
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Gin 


Glu 


Thr 


Gly Leu 


Leu Asn Lys Cys 


Glu 


Arg 


He Gin Gly Arg Lys 


625 








630 




635 


640 


Ala 


Ser 


Leu 


Glu Glu 


He Gin Leu Val 


His 


Ser 


Glu His His Ser Leu 








645 




650 




655 


Leu 


Tyr 


Gly 


Thr Asn 


Pro Leu Asp Gly 


Gin 


Lys 


Leu Asp Pro Arg He 








660 


665 






670 


Leu 


Leu 


Gly 


Asp Asp 


Ser Gin Lys Phe 


Phe 


Ser 


Ser Leu Pro Cys Gly 






675 




680 






685 


Gly 


Leu 


Gly 


Val Asp 


Ser Asp Thr He 


Trp 


Asn 


Glu Leu His Ser Ser 




690 






695 






700 


Gly 


Ala 


Ala 


Arg Met 


Ala Val Gly Cys 


Val 


He 


Glu Leu Ala Ser Lys 


705 








710 




715 


720 


Val 


"Ala 


Ser 


Gly Glu 


Leu Lys Asn Gly 


Phe 


Ala 


Val Val Arg Pro Pro 








725 




730 




735 


Gly 


His 


His 


Ala Glu 


Glu Ser Thr Ala 


Met 


Gly 


Phe Cys Phe Phe Asn 








740 


745 






750 


Ser 


Val 


Ala 


lie Thr 


Ala Lys Tyr Leu 


Arg 


Asp 


Gin Leu Asn He Ser 






755 




760 






765 


Lys 


Xle 


Leu 


He Val 


Asp Leu Asp Val 


His 


His 


Gly Asn Gly Thr Gin 




770 






775 






780 


Gin 


Ala 


Phe 


Tyr Ala 


Asp Pro Ser lie 


Leu 


Tyr 


He Ser Leu His Arg 


785 








790 




795 


800 


Tyr 


Asp 


Glu 


Gly Asn 


Phe Phe Pro Gly 


Ser 


Gly 


Ala Pro Asn Glu Val 








805 




810 




815 


Arg 


Phe 


He 


Ser Leu 


Glu Pro His Phe 


Tyr 


Leu 


Tyr Leu Ser Gly Asn 








820 


825 






830 


Cys 


lie 


Ala 
















835 













<210> 9 

<211> 1791 

<212> DKTA 

<213> Homo sapiens 

<400> 9 

ggggaagaga ggcacagaca cagataggag aagggcaccg gctggagcca cttgcaggac 60 
tgagggtttt tgcaacaaaa ccctagcagc ctgaagaact ctaagccaga tggggtggct 120 
ggacgagagc agctcttggc tcagcaaaga atgcacagta tgatcagctc agtggatgtg 180 
aagtcagaag ttcctgtggg cctggagccc atctcacctt tagacctaag gacagacctc 240 
aggatgatga tgcccgtggt ggaccctgtt gtccgtgaga agcaattgca gcaggaatta 3 00 
cttcttatcc agcagcagca acaaatccag aagcagcttc tgatagcaga gtttcagaaa 360 
cagcatgaga acttgacacg gcagcaccag gctcagcttc aggagcatat caaggaactt 420 
ctagccataa aacagcaaca agaactccta gaaaaggagc agaaactgga gcagcagagg 480 
caagaacagg aagtagagag gcatcgcaga gaacagcagc ttcctcctct cagaggcaaa 540 
gatagaggac gagaaagggc agtggcaagt acagaagtaa agcagaagct tcaagagttc 600 
ctactgagta aatcagcaac gaaagacact ccaactaatg gaaaaaatca ttccgtgagc 660 
cgccatccca agctctggta cacggctgcc caccacacat cattggatca aagctctcca 720 
ccccttagtg gaacatctcc atcctacaag tacacattac caggagcaca agatgcaaag 780 
gatgatttcc cccttcgaaa aactgaatcc tcagtcagta gcagttctcc aggctctggt 840 
cccagttcac caaacaatgg gccaactgga agtgttactg aaaatgagac ttcggttttg 900 
ccccctaccc ctcatgccga gcaaatggtt tcacagcaac gcattctaat tcatgaagat 960 
tccatgaacc tgctaagtct ttatacctct ccttctttgc ccaacattac cttggggctt 1020 
cccgcagtgc catcccagct caatgcttcg aattcactca aagaaaagca gaagtgtgag 1080 
acgcagacgc ttaggcaagg tgttcctctg cctgggcagt atggaggcag catcccggca 1140 
tcttccagcc accctcatgt tactttagag ggaaagccac ccaacagcag ccaccaggct 1200 
ctcctgcagc atttattatt gaaagaacaa atgcgacagc aaaagcttct tgtagctggt 1260 
ggagttccct tacatcctca gtctcccttg gcaacaaaag agagaatttc acctggcatt 1320 
agaggtaccc acaaattgcc ccgtcacaga cccctgaacc gaacccagtc tgcacctttg 1380 
cc tcagagca cgttggctca gctggtcatt caacagcaac accagcaatt cttggagaag 1440 
cagaagcaat accagcagca gatccacatg aacaaactgc tttcgaaatc tattgaacaa 1500 
ctgaagcaac caggcagtca ccttgaggaa gcagaggaag agcttcaggg ggaccaggcg 1560 
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^™=? gaag aca ^ c ^ cc ctctagtggc aacagcacta ggagcgacag cagtgcttgt 1620 
gtggatgaca cactgggaca agttggggct gtgaaggtca aggaggaacc ag£ggaca?t 1680 
Itllltn 9 Ctca ^ cca Waaatggaa tctggggagc aggctgcttt tatgcaacag 1740 
gtaataggca aagatttagc tccaggattt gtaattaaag tcattatctg a 1791 

<210> 10 
<211> 546 
<212> PRT 

<213> Homo sapiens 
<400> 10 

Met His Ser Met lie Ser Ser Val Asp Val Lys Ser Glu Val Pro Val 

Gly Leu Glu Pro He Ser Pro Leu Asp Leu Arg Thr Asp Leu Arg Met 

20 25 *an 

Met Met Pro Val Val Asp Pro Val Val Arg Glu Lys Gin Leu Gin Gin 

35 40 45 

Glu Leu Leu Leu lie Gin Gin Gin Gin Gin He Gin Lys Gin Leu Leu 

su 55 50 

lie Ala Glu Phe Gin Lys Gin His Glu Asn Leu Thr Arg Gin His Gin 

70 75 
Ala Gin Leu Gin Glu His lie Lys Glu Leu Leu Ala He Lys Gin Gin 

85 90 95 

Gin Glu Leu Leu Glu Lys Glu Gin Lys Leu Glu Gin Gin Arg Gin Glu 

0 105 no 

Gin Glu Val Glu Arg His Arg Arg Glu Gin Gin Leu Pro Pro Leu Arg 

115 120 125 

Gly Lys Asp Arg Gly Arg Glu Arg Ala Val Ala Ser Thr Glu Val Lys 

135 140 
Gin Lys Leu Gin Glu Phe Leu Leu Ser Lys Ser Ala Thr Lys Asp Thr 
150 155 160 

Pro Thr Asn Gly Lys Asn His Ser Val Ser Arg His Pro Lys Leu Trp 

165 170 175 

Tyr Thr Ala Ala His His Thr Ser Leu Asp Gin Ser Ser Pro Pro Leu 

180 18 5 190 

Ser Gly Thr Ser Pro Ser Tyr Lys Tyr Thr Leu Pro Gly Ala Gin Asp 
», T 195 200 205 

Ala Lys Asp Asp Phe Pro Leu Arg Lys Thr Glu Ser Ser Val Ser Ser 

210 215 220 

Ser Ser Pro Gly Ser Gly Pro Ser Ser Pro Asn Asn Gly Pro Thr Gly 

230 235 
Ser Val Thr Glu Asn Glu Thr Ser Val Leu Pro Pro Thr Pro His Ala 

245 250 255 

Glu Gin Met Val Ser Gin Gin Arg lie Leu lie His Glu Asp Ser Met 

260 265 270 

Asn Leu Leu Ser Leu Tyr Thr Ser Pro Ser Leu Pro Asn He Thr Leu 

2/5 280 285 

Gly Leu Pro Ala Val Pro Ser Gin Leu Asn Ala Ser Asn Ser Leu Lys 
„, 290 295 300 

Glu Lys Gin Lys Cys Glu Thr Gin Thr Leu Arg Gin Gly Val Pro Leu 
31° 315 -ion 

Pro Gly Gin Tyr Gly Gly Ser He Pro Ala Ser Ser Ser His Pro His 

325 330 335 

Val Thr Leu Glu Gly Lys Pro Pro Asn Ser Ser His Gin Ala Leu Leu 
„ n . 340 345 350 

Gin His Leu Leu Leu Lys Glu Gin Met Arg Gin Gin Lys Leu Leu Val 

355 36O 365 

Ala Gly Gly Val Pro Leu His Pro Gin Ser Pro Leu Ala Thr Lys Glu 

/u 375 38O 

Arg He Ser Pro Gly He Arg Gly Thr His Lys Leu Pro Arg His Arg 
_ _ 390 395 40Q 

Pro Leu Asn Arg Thr Gin Ser Ala Pro Leu Pro Gin Ser Thr Leu Ala 
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An r 




O-LXl 


Lieu 


\7s»1 Tla Pin 

vai -Lie kjxn 


vjIH 






ao n 






Tyr 


o±n oJLIl o-LIl 


lie 






A 1 R 
4 J j 




pi i, 
VjjIU 


bin 


Leu Lys Gin 


Pro 




/i c n 






Leu 


Gin 


Gly Asp Gin 


Ala 


465 






470 


Asn 


Ser 


Thr Arg Ser 


Asp 






485 




Gin 


Val 


Gly Ala Val 


Lys 






500 




Asp 


Ala 


Gin lie Gin 


Glu 






515 




Gin 


Gin 


Val lie Gly 


Lys 




530 






lie 


lie 






545 









<210> 11 

<211> 590 

<212> PRT 

<213> Homo sapiens 



<400> 11 



Met 


His 


Ser 


Met 


lie 


Ser 


1 








5 




Gly 


Leu 


Glu 


Pro 


lie 


Ser 








20 






Met 


Met 


Pro 


Val 


Val 


Asp 






35 








Glu 


Leu 


Leu 


Leu 


He 


Gin 




50 










lie 


Ala 


Glu 


Phe 


Gin 


Lys 


65 










70 


Ala 


Gin 


Leu 


Gin 


Glu 


His 










85 




Gin 


Glu 


Leu 


Leu 


Glu 


Lys 








100 






Gin 


Glu 


Val 


Glu 


Arg 


His 






115 








Gly 


Lys 


Asp 


Arg 


Gly 


Arg 




130 










Gin 


Lys 


Leu 


Gin 


Glu 


Phe 


145 










150 


Pro 


Thr 


Asn 


Gly 


Lys 


Asn 










165 




Tyr 


Thr 


Ala 


Ala 


His 


His 








180 






Ser 


Gly 


Thr 


Ser 


Pro 


Ser 






195 








Ala 


Lys 


Asp 


Asp 


Phe 


Pro 




210 










Lys 


Val 


Arg 


Ser 


Arg 


Leu 


225 










230 


Pro 


Leu 


Leu 


Arg 


Arg 


Lys 










245 




Arg 


Met 


Phe 


Glu 


Val 


Thr 








260 






Ser 


Gly 


Pro 


Ser 


Ser 


Pro 



15/25 
410 



Gin 




Gin 


Gin 


Phe 


Leu 






425 








His 


Met 


Asn Lys 


Leu 


Leu 




440 










Gly 


Ser 


His 


Leu 


Glu 


Glu 


455 










460 


Met 


Gin 


Glu Asp 


Arg Ala 










475 




Ser 


Ser 


Ala 


Cys 


Val 


Asp 








490 






Val 


Lys 


Glu 


Glu 


Pro 


Val 






505 








Met 


Glu 


Ser Gly 


Glu 


Gin 




u 










Asp 


Leu 


Ala 


Pro 


Gly Phe 












540 


Ser 


vai 


Asp Val 


Lys 


Ser 








10 






Pro 


Leu 


Asp 


Leu 


Arg 


Thr 






25 








Pro 


vai 


Val 


Arg 


Glu 


Lys 




a n 










Cj±n 


bin 


Gin 


Gin 


He 


Gin 


R £ 

JJ 










60 


bin 


HIS 


Glu 


Asn 


Leu 


Thr 










75 




lie 


Lys 


Glu 


Leu 


Leu 


Ala 








90 






blu 


vjin 


Lys 


Leu 


Glu 


Gin 






105 








Arg 


Arg 


Glu 


Gin 


Gin 


Leu 




1<2 u 










blli 


Arg 


Ala 


Val 


Ala 


Ser 


1J J 










140 


Jjcu 


ucu 


Ser 


Lys 


Ser 


Ala 










155 




raj. t> 


Cov- 
oci 


Val 


Ser 


Arg 


His 








170 






Thr 


Ser 


Leu 


Asp 


Gin 


Ser 






185 








Tyr 


Lys 


Tyr 


Thr 


Leu 


Pro 




200 










Leu 


Arg 


Lys 


Thr 


Ala 


Ser 


215 










220 


Lys 


Gin 


Lys 


Val 


Ala 


Glu 










235 




Asp 


Gly 


Asn 


Val 


Val 


Thr 








250 






Glu 


Ser 


Ser 


Val 


Ser 


Ser 






265 








Asn 


Asn 


Gly Pro 


Thr Gly 



PCT/US02/19051 







A1 K 




Glu Lys 


tain 


Lys 




430 






Ser 


Lys 


Ser 


He 


445 








Ala 


Glu 


Glu 


Glu 


Pro 


Ser 


Ser 


Gly 








480 


Asp 


Thr 


Leu 


Gly 






495 




Asp 


Ser 


Asp 


Glu 




510 






Ala 


Ala 


Phe 


Met 


525 








Val 


lie 


Lys 


Val 



Glu Val 


Pro 


Val 






15 




Asp Leu 


Arg 


Met 




30 






Gin 


Leu 


Gin 


Gin 


45 








Lys 


Gin 


Leu 


Leu 


Arg Gin 


His 


Gin 








80 


lie 


Lys 


Gin 


Gin 






95 




Gin 


Arg 


Gin 


Glu 




110 






Pro 


Pro 


Leu 


Arg 


125 








Thr 


Glu 


Val 


Lys 


Thr. 


Lys 


Asp 


Thr 








160 


Pro 


Lys 


Leu 


Trp 






175 




Ser 


Pro 


Pro 


Leu 




190 






Gly Ala 


Gin 


Asp 


205 








Glu 


Pro 


Asn 


Leu 


Arg Arg 


Ser 


Ser 








240 


Ser 


Phe 


Lys 


Lys 






255 




Ser 


Ser 


Pro 


Gly 




270 






Ser 


Val 


Thr 


Glu 
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275 280 28S 

Asn Glu Thr Ser Val Leu Pro Pro Thr Pro His Ala Glu Gin Met Val 

Ser Gin Gin Arg He Leu lie His Glu Asp Ser Met Asn Leu Leu Ser 

Leu Tyr Thr Ser. Pro Ser Leu Pro Asn lie Tnr Leu Gly Leu Pro 2 

J ZD 330 

Val Pro Ser Gin Leu Asn Ala Ser Asn Ser Leu Lys Glu Lys Gin Lys 



Cys Glu Thr Gin Thr Leu Arg Gin cly Val Pro Leu Pro Gly Gin Tyr 

Leu 
Leu 
Gly 

Pro Leu His Pro Gin Ser Pro Leu Ala Thr Lys Glu Arg lie Ser Pro 



355 360 

Val 
Gin 

u 395 >i nn 

Leu Lys Glu Gin Met Arg Gin Gin Lys Leu Leu Val Ala Gly Gly Val 
405 410 * J * 



Gly Gly Ser lie Pro Ala Ser Ser Ser His Pro His Val Thr Leu Glu 

„„ v J/0 -tan 

Gly Lys Pro Pro Asn Ser - 

38b 390 395 — 



385 ^ Pr ° ASn !E SSr His Gln Ala Gin His Leu Leu 



420 425 430 

Thr Gin Ser Ala Pro Leu Pro Gin Ser Thr Leu Ala Gin Leu Val He 



Gly lie Arg Gly Thr His Lys Leu Pro Arg His Arg Pro Leu Asn Arg 



450 455 460 

Gin Gin Gin His Gin Gin Phe Leu Glu Lys Gin Lys Gin Tyr Gin Gin 

Gin lie His Met Asn Lys Leu Leu Ser Lys Ser He Glu Gin Leu tys 

490 AQt? 
Gin Pro Gly Ser His Leu Glu Glu Ala Glu Glu Glu Leu Gin Gly Asp 

Gin Ala Met Gin Glu Asp Arg Ala Pro Ser Ser Gly Asn Ser Thr Arg 

Ser Asp Ser Ser Ala Cys val Asp Asp Thr Leu Gly Gin Val Gly Ala 

Val Lys Val Lys Glu Glu Pro Val Asp Ser Asp gIu Asp Ala Gin He 

Ala 
570 



550 55c ' ~" 

Gin Glu Met Glu Ser Gly Glu Gin Ala Ala Phe Met Gin Gin Val 21 

Gly Lys Asp Leu Ala Pro Gly Phe Val lie Lys Val He lie ^ 

yu 585 59 0 



<210> 12 

<211> 1084 

<212> PRT 

<213> Homo sapiens 

<400> 12 



Met ser Ser Gin Ser His Pro Asp Gly Leu Ser Gly Arg Asp Gin Pro 

Val Glu Leu Leu Asn Pro Ala Arg Val Asn His Met Pro Ser Thr Val 
zu 25 



5 ' ~ 10 15 

Asp Val Ala Thr Ala Leu Pro Leu £» Val Ala Pro Ser Ma Val Pro 

40 AC 

Met Asp Leu Arg Leu Asp His Gin Phe Ser Leu Pro Val Ala Glu Pro 
Ala Leu Arg Glu Gin Gin Leu Gin Gin Glu Leu Leu Ala Leu Lys Gin 

70 7 5 80 



Lys Gin Gin He Gin Arg Gin He Leu He Ala Glu Phe Gin Arg gL 

Ala Gin Leu His Glu 
Lys Gin Gin Gin Glu Met Leu Ala Met Lys His Gin Gin S Leu Leu 



oj 90 

His Glu Gin Leu Ser Arg Gin His Glu Ala Gin Leu His Glu His He 
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115 










120 








125 








Glu His 


Gin 


Arg 


Lys 


Leu Glu Arg 


His Arg " Gin 


Glu Gin 


Glu 


Leu 


Glu 


130 










135 










140 








Lys Gin 


His 


Arg 


Glu Gin Lys 


Leu 


Gin 


Gin 


Leu 


Lys Asn 


Lys 


Glu 


Lys 


145 








150 










155 








160 


Gly Lys Glu 


Ser 


Ala 


Val 


Ala 


Ser 


Thr 


Glu Val 


Lys Met 


Lys 


Leu 


Gin 








165 










170 








175 




Glu Phe.Val 


Leu 


Asn Lys 


Lys 


Lys 


Ala 


Leu Ala 


His Arg 


Asn 


Leu 


Asn 






180 










185 








190 






His Cys 


He 


Ser 


Ser Asp 


Pro 


Arg 


Tyr Trp Tyr 


Gly Lys 


Thr 


Gin 


His 




195 










200 








205 








Ser Ser 


Leu 


Asp 


Gin 


Ser 


Ser 


Pro 


Pro 


Gin 


Ser 


Gly Val 


Ser 


Thr 


Ser 


210 










215 










220 








Tyr Asn 


His 


Pro 


Val Leu Gly Met 


Tyr Asp Ala 


Lys Asp 


Asp 


Phe 


Pro 


225 








230 










235 








240 


Leu Arg 


Lys 


Thr 


Ala 


Ser 


Glu 


Pro 


Asn Leu Lys 


Leu Arg 


Ser 


Arg 


Leu 








245 










250 








255 




Lys Gin 


Lys 


Val 


Ala 


Glu 


Arg 


Arg 


Ser 


Ser 


Pro 


Leu Leu 


Arg 


Arg 


Lys 






260 










265 








270 






Asp Gly 


Pro 


Val 


Val 


Thr 


Ala 


Leu 


Lys 


Lys 


Arg 


Pro Leu 


Asp 


Val 


Thr 




275 










280 








285 








Asp Ser 


Ala 


Cys 


Ser 


Ser 


Ala 


Pro 


Gly 


Ser Gly 


Pro Ser 


Ser 


Pro 


Asn 


290 










295 










300 








Asn Ser 


Ser 


Gly 


Ser 


Val 


Ser 


Ala 


Glu 


Asn 


Gly 


He Ala 


Pro 


Ala 


Val 


305 








310 










315 








320 


Pro Ser 


He 


Pro 


Ala 


Glu 


Thr 


Ser 


Leu 


Ala 


His 


Arg Leu 


Val 


Ala 


Arg 








325 










330 








335 




Glu Gly Ser 


Ala 


Ala 


Pro 


Leu 


Pro 


Leu Tyr Thr 


Ser Pro 


Ser 


Leu 


Pro 






340 










345 








350 






Asn He 


Thr 


Leu 


Gly Leu 


Pro 


Ala 


Thr Gly Pro 


Ser Ala 


Gly 


Thr 


Ala 




355 










360 








365 








Gly Gin Gin 


Asp 


Thr Glu Arg Leu 


Thr 


Leu 


Pro 


Ala Leu 


Gin 


Gin 


Arg 


370 










375 










380 








Leu Ser 


Leu 


Phe 


Pro Gly Thr His 


Leu 


Thr 


Pro 


Tyr Leu 


Ser 


Thr 


Ser 


385 








390 










395 








400 


Pro Leu Glu 'Arg 


Asp Gly Gly Ala 


Ala 


His 


Ser 


Pro Leu 


Leu 


Gin 


His 








405 










410 








415 




Met Val 


Leu 


Leu 


Glu 


Gin 


Pro 


Pro 


Ala 


Gin Ala 


Pro Leu 


Val 


Thr 


Gly 






420 










425 








430 




Leu Gly Ala 


Leu 


Pro 


Leu 


His 


Ala 


Gin 


Ser 


Leu 


Val Gly 


Ala 


Asp 


Arg 




435 










440 








445 






Val Ser 


Pro 


Ser 


He 


His 


Lys 


Leu 


Arg 


Gin 


His 


Arg Pro 


Leu 


Gly 


Arg 


450 










455 










460 








Thr Gin 


Ser 


Ala 


Pro 


Leu 


Pro 


Gin 


Asn 


Ala 


Gin 


Ala Leu 


Gin 


His 


Leu 


465 








470 










475 








480 


Val He 


Gin 


Gin 


Gin 


His 


Gin 


Gin 


Phe 


Leu 


Glu 


Lys His 


Lys 


Gin 


Gin 








485 










490 








495 




Phe Gin 


Gin 


Gin 


Gin 


Leu 


Gin 


Met 


Asn 


Lys 


He 


He Pro 


Lys 


Pro 


Ser 






500 










505 








510 






Glu Pro 


Ala 


Arg 


Gin 


Pro 


Glu 


Ser 


His 


Pro 


Glu 


Glu Thr 


Glu 


Glu 


Glu 




515 










520 








525 








Leu Arg 


Glu 


His 


Gin 


Ala 


Leu 


Leu 


Asp 


Glu 


Pro 


Tyr Leu 


Asp 


Arg 


Leu 


530 










535 










540 








Pro Gly Gin 


Lys 


Glu 


Ala 


His 


Ala 


Gin 


Ala 


Gly 


Val Gin 


Val 


Lys 


Gin 


545 








550 










555 








560 


Glu Pro 


He 


Glu 


Ser Asp Glu Glu 


Glu 


Ala 


Glu 


Pro Pro 


Arg 


Glu 


Val 








565 










570 








575 




Glu Pro 


Gly 


Gin 


Arg 


Gin 


Pro 


Ser 


Glu 


Gin 


Glu 


Leu Leu 


Phe 


Arg 


Gin 






580 










585 








590 






Gin Ala 


Leu 


Leu 


Leu 


Glu 


Gin 


Gin 


Arg 


He 


His 


Gin Leu 


Arg 


Asn 


Tyr 




595 










600 








605 








Gin Ala 


Ser 


Met 


Glu Ala Ala Gly 


He 


Pro 


Val 


Ser Phe 


Gly 


Gly 


His 
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610 615 62Q 

Arg Pro Leu Ser Arg Ala Gin Ser Ser Pro Ala Ser Ala Thr Phe Pro 

630 635 640 

Val Ser Val Gin Glu Pro Pro Thr Lys Pro Arg Phe Thr Thr Gly Leu 

645 650 655 

Val Tyr Asp Thr Leu Met Leu Lys His Gin Cys Thr Cys Gly Ser Ser 

t 660 665 67Q 

Ser Ser His Pro Glu His Ala Gly Arg He Gin Ser lie Trp Ser Arg 

675 680 685 

Leu Gin Glu Thr Gly Leu Arg Gly Lys Cys Glu Cys lie Arg Gly Arg 

Lys Ala Thr Leu Glu Glu Leu Gin Thr Val His Ser Glu Ala His Thr 

710 715 720 

Leu Leu Tyr Gly Thr Asn Pro Leu Asn Arg Gin Lys Leu Asp Ser Lys 

725 730 735 

Lys Leu Leu Gly Ser Leu Ala Ser Val Phe Val Arg Leu Pro Cys Gly 

740 745 750 

Gly Val Gly Val Asp Ser Asp Thr He Trp Asn Glu Val His Ser Ala 
755 7 60 765 

GlY 7^n Ala ^ g LSU Ala Val Gly Cys Val Val Glu Leu Val Phe Lys 

770 775 780 

Val Ala Thr Gly Glu Leu Lys Asn Gly Phe Ala Val Val Arg Pro Pro 

790 795 800 

Gly His His Ala Glu Glu Ser Thr Pro Met Gly Phe Cys Tyr Phe Asn 

805 810 sis 

Ser Val Ala Val Ala Ala Lys Leu Leu Gin Gin Arg Leu Ser Val Ser 

820 825 830 

Lys He Leu He Val Asp Trp Asp Val His His Gly Asn Gly Thr Gin 

835 840 845 

Gin Ala Phe Tyr Ser Asp Pro Ser Val Leu Tyr Met Ser Leu His Arg 

850 855 8 60 

Tyr Asp Asp Gly Asn Phe Phe Pro Gly Ser Gly Ala Pro Asp Glu Val 
n 870 875 880 

Gly Thr Gly Pro Gly Val Gly Phe Asn Val Asn Met Ala Phe Thr Glv 
885 8 9o 895 y 

Gly Leu Asp Pro Pro Met Gly Asp Ala Glu Tyr Leu Ala Ala Phe Arg 

. 900 905 910 

Thr Val Val Met Pro He Ala Ser Glu Phe Ala Pro Asp Val Val Leu 

915 920 925 

Val Ser Ser Gly Phe Asp Ala Val Glu Gly His Pro Thr Pro Leu Gly 

930 935 940 

Gly Tyr Asn Leu Ser Ala Arg Cys Phe Gly Tyr Leu Thr Lys Gin Leu 

950 955 960 

Met Gly Leu Ala Gly Gly Arg He Val Leu Ala Leu Glu Gly Gly His 

965 970 975 

Asp Leu Thr Ala He Cys Asp Ala Ser Glu Ala Cys Val Ser Ala Leu 

980 985 990 

Leu Gly Asn Glu Leu Asp Pro Leu Pro Glu Lys Val Leu Gin Gin Arg 
995 1000 1005 

^?n Ala Asn Ala Val Arg Ser Met Glu Val Met Glu He His 

1010 1015 1020 

Ser Lys Tyr Trp Arg Cys Leu Gin Arg Thr Thr Ser Thr Ala Gly Arg 

1030 1035 1040 

Ser Leu He Glu Ala Gin Thr Cys Glu Asn Glu Glu Ala Glu Thr Val 

mu n 1045 1050 1055 

Thr Ala Met Ala Ser Leu Ser Val Gly Val Lys Pro Ala Glu Lys Arg 

1060 1065 1070 

Pro Asp Glu Glu Pro Met Glu Glu Glu Pro Pro Leu 
1075 1080 



<210> 13 



WO 02/102984 



PCT/US02/19051 



19/25 



<211> 3550 

<212> DNA 

<213> Homo sapiens 

<400> 13 

ggggaagaga ggcacagaca cagataggag aagggcaccg gctggagcca cttgcaggac 60 
tgagggtttt tgcaacaaaa ccctagcagc ctgaagaact ctaagccaga tggggtggct 120 
ggacgagagc agctcttggc tcagcaaaga atgcacagta tgatcagctc agtggatgtg 180 
aagtcagaag ttcctgtggg cctggagccc atctcacctt tagacctaag gacagacctc 240 
aggatgatga tgcccgtggt ggaccctgtt gtccgtgaga agcaattgca gcaggaatta 300 
cttcttatcc agcagcagca acaaatccag aagcagcttc tgatagcaga gtttcagaaa 360 
cagcatgaga acttgacacg gcagcaccag gctcagcttc aggagcatat caaggaactt 420 
ctagccataa aacagcaaca agaactccta gaaaaggagc agaaactgga gcagcagagg 480 
caagaacagg aagtagagag gcatcgcaga gaacagcagc ttcctcctct cagaggcaaa 540 
gatagaggac gagaaagggc agtggcaagt acagaagtaa agcagaagct tcaagagttc 600 
ctactgagta aatcagcaac gaaagacact ccaactaatg gaaaaaatca ttccgtgagc 660 
cgccatccca agctctggta cacggctgcc caccacacat cattggatca aagctctcca 720 
ccccttagtg gaacatctcc atcctacaag tacacattac caggagcaca agatgcaaag 780 
gatgatttcc cccttcgaaa aactgcctct gagcccaact tgaaggtgcg gtccaggtta 840 
aaacagaaag tggcagagag gagaagcagc cccttactca ggcggaagga tggaaatgtt 900 
gtcacttcat tcaagaagcg aatgtttgag gtgacagaat cctcagtcag tagcagttct 960 
ccaggctctg gtcccagttc accaaacaat gggccaactg gaagtgttac tgaaaatgag 1020 
acttcggttt tgccccctac ccctcatgcc gagcaaatgg tttcacagca acgcattcta 1080 
attcatgaag attccatgaa cctgctaagt ctttatacct ctccttcttt gcccaacatt 1140 
accttggggc ttcccgcagt gccatcccag ctcaatgctt cgaattcact caaagaaaag 1200 
cagaagtgtg agacgcagac gcttaggcaa ggtgttcctc tgcctgggca gtatggaggc 1260 
agcatcccgg catcttccag ccaccctcat gttactttag agggaaagcc acccaacagc 1320 
agccaccagg ctctcctgca gcatttatta ttgaaagaac aaatgcgaca gcaaaagctt 1380 
cttgtagctg gtggagttcc cttacatcct cagtctccct tggcaacaaa agagagaatt 1440 
tcacctggca ttagaggtac ccacaaattg ccccgtcaca gacccctgaa ccgaacccag 1500 
tctgcacctt tgcctcagag cacgttggct cagctggtca ttcaacagca acaccagcaa 1560 
ttcttggaga agcagaagca ataccagcag cagatccaca tgaacaaact gctttcgaaa 1620 
tctattgaac aactgaagca accaggcagt caccttgagg aagcagagga agagcttcag 1680 
ggggaccagg cgatgcagga agacagagcg ccctctagtg gcaacagcac taggagcgac 1740 
agcagtgctt gtgtggatga cacactggga caagttgggg ctgtgaaggt caaggaggaa 1800 
ccagtggaca gtgatgaaga tgctcagatc caggaaatgg aatctgggga gcaggctgct 1860 
tttatgcaac aggtaatagg caaagattta gctccaggat ttgtaattaa agtcattatc 1920 
tgacctttcc tggaacccac gcacacacgt gcgctctctg tgcgccaagc tccgctggct 1980 
gcggttggca tggatggatt agagaaacac cgtctcgtct ccaggactca ctcttcccct 2040 
gctgcctctg ttttacctca cccagcaatg gaccgccccc tccagcctgg ctctgcaact 2100 
ggaattgcct atgacccctt gatgctgaaa caccagtgcg tttgtggcaa ttccaccacc 2160 
caccctgagc atgctggacg aatacagagt atctggtcac gactgcaaga aactgggctg 2220 
ctaaataaat gtgagcgaat tcaaggtcga aaagccagcc tggaggaaat acagcttgtt 2280 
cattctgaac atcactcact gttgtatggc accaaccccc tggacggaca gaagctggac 2340 
cccaggatac tcctaggtga tgactctcaa aagttttttt cctcattacc ttgtggtgga 2400 
cttggggtgg acagtgacac catttggaat gagctacact cgtccggtgc tgcacgcatg 2460 
gctgttggct gtgtcatcga gctggcttcc aaagtggcct caggagagct gaagaatggg 2520 
tttgctgttg tgaggccccc tggccatcac gctgaagaat ccacagccat ggggttctgc 2580 
ttttttaatt cagttgcaat taccgccaaa tacttgagag accaactaaa tataagcaag 2640 
atattgattg tagatctgga tgttcaccat ggaaacggta cccagcaggc cttttatgct 2700 
gaccccagca tcctgtacat ttcactccat cgctatgatg aagggaactt tttccctggc 2760 
agtggagccc caaatgaggt tcggtttatt tctttagagc cccactttta tttgtatctt 2820 
tcaggtaatt gcattgcatg attaccccta attttcttgt cctttgctgg tgttttaaat 2880 
tacacgagat tactgaattg tcccatggga ccaagaacca gtgcagaaca agtgcataac 2940 
ccagagcact gtttgtcagg gaaggttggg ctgatttgat gtgttgtttg atgtttattt 3 000 
caagagctcc catgtgcttg ttttcctctc ttcttgcttt cttccatttg ctctcttctc 3060 
tgcccaccgt ggtgtgtctt tctcttccca ggttggaaca ggccttggag aagggtacaa 3120 
tataaatatt gcctggacag gtggccttga tcctcccatg ggagatgttg agtaccttga 3180 
agcattcagg accatcgtga agcctgtggc caaagagttt gatccagaca tggtcttagt 3240 
atctgctgga tttgatgcat tggaaggcca cacccctcct ctaggagggt acaaagtgac 3300 
ggcaaaatgt tttggtcatt tgacgaagca attgatgaca ttggctgatg gacgtgtggt 3360 
gttggctcta gaaggaggac atgatctcac agccatctgt gatgcatcag aagcctgtgt 3420 
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«==5 9 !r CC £ ct f^aaatg agctggagcc acttgcagaa gatattctcc accaaagccc 3480 
S?c?ct5£ gCtgttattt cttta ^gaa gatcattgaa attcaaagta tgtcttLaa 3540 

3550 

<210> 14 

<211> 7699 . 

<212> DNA 

<213> Homo sapiens 

<400> 14 

^^»^ gCC at *: caggctg cgcaactgtt gggaagggcg atcggtgcgg gcctcttcgc 60 
SSf?^ B gc t ggcgaaa gggggatgtg ctgcaaggcg attaagttgg gtaacgccca 120 
gggttttccc agtcacgacg ttgtaaaacg acggccagtg ccaagctgat ctaatcaata 180 
ttggccatta gccatattat tcattggtta tatagcataa atcaataEtg gctattggcc 240 
attgcatacg ttgtatccat atcataatat gtacatttat attggctcat gtccaacatt 300 
accgccatgt tgacattgat tattgactag ttattaatag taateaatta cggggtcatt 360 
clllrntT Ccatatatgg a gttccgcgt tacataactt acggtaaatg gcccjcctgg 420 
cgaccgccca gcgacccccg cccgttgacg tcaatagtga cgtatgttcc catagtaacg 480 
ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac tgcccacttg 540 
gcagtacatc aagtgtatca tatgccaagt ccgcccccta ttgacgtcaa tgacggtS 600 

» a ^ attat9C f°T aCatg acc " a ^gg agtttcctac t?ggcagtac 660 
tagtcatcgc tattaccatg gtgatgcggt tttggcagta caccaatggg 720 
latttT^ ggtttgaCtc f^gggattt ccaagtctcc accccattga cgtcaatggg 780 
agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaataa ccccgccccg 840 
atat Jtntn tgggCggtag g^gtgtacgg tgggaggtct atataagcag agctcgttta 900 
allctatttn agaatt ^ aag f tgcggccg cagatctatc gatctgcagg atatcaccat 960 
ctcaccttta Si' tggatg * gaa gtcagaagtt cctgtgggcc tggagcccat 1020 
crahS™ gacctaagga cagacctcag gatgatgatg cccgtggtgg accctgttgt 1080 
Scagct?c a a ataacfaa^ aggaattaCt ^"atccag cagcagcaac aaatccagaa 114? 
?^ ag " Ctg ata gcagagt ttcagaaaca gcatgagaac ttgacacggc agcaccaggc 1200 
gagcatatca aggaacttct agccataaaa cagcaacaag aactcctaga 1260 
acaacaact? ^l^t^ agcagaggca a gaacaggaa gtagagaggc atcgcagaga 1320 
aaaf^aaaa <WW°«*Sr« tagaggacga gaaagggcag tggcaagtac 1380 

aa a t^ aaa ? ^gaagcttc a agagttcct actgagtaaa tcagcaacga aagacactcc 1440 
ccacacafca ttaT^ cc f ga ^g ccatcccaag ctctggtaca cggctgccca 1500 
cacf a ? a ^ a ttggatcaaa gctctccacc ccttagtgga acatctccat cctacaagta 1560 
£r™ a ^ g ^agcacaag atgcaaagga tgatttcccc cttcgaaaaa ctgcctctga 1620 
cttactcaaa caall^T CCagg " aaa a <=agaaagtg gcagagagga gaagcagcfc leSO 
lllTr. f 09 = ggaaggatg gaaatgttgt cacttcattc aagaagcgaa tgtttgaggt 1740 
gccaactaaa lllltTT gCagttctcc ^tctggt cccag?tcac caaacLgg lloo 
^" a S ^"actg aaaatgagac ttcggttttg ccccctaccc ctcatgccga 1860 
SataccSt gcattc * aat tcatgaagat tccatgaacc tgctaagtct 1920 

caatactS ttltn^ 9 ° CCaacattac ^ttggggctt cccgcagtgc catcccagct 1980 
tatSctr^ a ^ a ° tCa aagaaaagca gaagtgtgag acgcagacgc ttaggcaagg 2040 
tactttaaan ™= gggCagt atggaggca 9 catcccggca tcttccagcc acccicatj? 2100 
aaai^™ ggaaagccac ccaacagcag ccaccaggct ctcctgcagc atttattatt 2160 
atctccct 33 a ^« agC aaaagc "ct tgtagctggt ggagttccct tacatcctca 2220 
ccaS™ f.*? * 9 aga 9 aatttc acctggcatt agaggtaccc acaaattgcc 2280 
attlatnl?? CCCCtgaacc gaacccagtc tgcacctttg cctcagagca cgttggctca 2340 
allnrlT^ caacagcaac accagcaatt cttggagaag cagaagcaat accagcagca 2400 
gatccacatg aacaaactgc tttcgaaatc tattgaacaa ctgaagcaac caggcagtca 2460 

ctSgSSc aacaaoac? 9 agCttCaggg gga ~ a ggcg atgcaggaag aca£cgcc 25 2 S 
aatt™^ aacagcacta ggagcgacag cagtgcttgt gtggatgaca cactgggaca 2580 
frtnn 9g C& aggaggaacc afftflgacagt gatgaagatg ctcag^cca 2640 
cacacofona r^^ agC aggctgc 5 fct tatgcaacag cctttcctgg aacccacgca 2700 
aaaaclcco? r?2£ f 90 gCcaagct = c Setggctgcg gttggcatgg atggattaga 2760 
facaataaS tlllll ,° & ggaC * Cactc "cccctgct gcctctgttt tacctcaccc 2820 
agcaatggac cgccccctcc agcctggctc tgcaactgga attgcctatg accccttaat 2880 
So aaaCaC = ag ^ gcgttt Stogeaatte caccacccac cctgagcatl ctggacglat 294? 
acagagtatc tggtcacgac tgcaagaaac tgggctgcta aataaatgtg agcgaa?tca 3000 
aggtcgaaaa gccagcctgg aggaaataca gcttgttcat tctgaacatc actcactgtt 3060 
gtatggcacc aaccccctgg acggacagaa gctggacccc aggatactcc taggtgatga 3120 
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ctctcaaaag tttttttcct cattaccttg tggtggactt ggggtggaca gtgacaccat 3180 
ttggaatgag ctacactcgt ccggtgctgc acgcatggct gttggctgtg tcatcgagct 3240 
ggcttccaaa gtggcctcag gagagctgaa gaatgggttt gctgttgtga ggccccctgg 3300 
ccatcacgct gaagaatcca cagccatggg gttctgcttt tttaattcag ttgcaattac 3360 
cgccaaatac ttgagagacc aactaaatat aagcaagata ttgattgtag atctggatgt 3420 
tcaccatgga aacggtaccc agcaggcctt ttatgctgac cccagcatcc tgtacatttc 3480 
actccatcgc tatgatgaag ggaacttttt ccctggcagt ggagccccaa atgaggttgg 3540 
aacaggcctt ggagaagggt acaatataaa tattgcctgg acaggtggcc ttgatcctcc 3600 
catgggagat gttgagtacc ttgaagcatt caggaccatc gtgaagcctg tggccaaaga 3660 
gtttgatcca gacatggtct tagtatctgc tggatttgat gcattggaag gccacacccc 3720 
tcctctagga gggtacaaag tgacggcaaa atgttttggt catttgacga agcaattgat 3780 
gacattggct gatggacgtg tggtgttggc tctagaagga ggacatgatc tcacagccat 3840 
ctgtgatgca tcagaagcct gtgtaaatgc ccttctagga aatgagctgg . agccacttgc 3900 
agaagatatt ctccaccaaa gcccgaatat gaatgctgtt atttctttac agaagatcat 3960 
tgaaattcaa agtatgtctt taaagttctc tggatccggt accagattac aaggacgacg 4020 
atgacaagta gatcccgggt ggcatccctg tgacccctcc ccagtgcctc tcctggcctt 4080 
ggaagttgcc actccagtgc ccaccagcct tgtcctaata aaattaagtt gcatcatttt 4140 
gtctgactag gtgtcctcta taatattatg gggtggaggg gggtggtatg gagcaagggg 4200 
cccaagttgg gaagacaacc tgtagggcct gcggggtcta ttcgggaacc aagctggagt 4260 
gcagtggcac aatcttggct cactgcaatc tccgcctcct gggttcaagc gattctcctg 4320 
cctcagcctc ccgagttgtt gggattccag gcatgcatga ccaggctcag ctaatttttg 4380 
tttttttggt agagacgggg tttcaccata ttggccaggc tggtctccaa ctcctaatct 4440 
caggtgatct acccaccttg gcctcccaaa ttgctgggat tacaggcgtg aaccactgct 4500 
cccttccctg tccttctgat tttaaaataa ctataccagc aggaggacgt ccagacacag 4560 
cataggctac ctgccatggc ccaaccggtg ggacatttga gttgcttgct tggcactgtc 4620 
ctctcatgcg ttgggtccac tcagtagatg cctgttgaat tgggtacgcg gccagcttct 4680 
gtggaatgtg tgtcagttag ggtgtggaaa gtccccaggc tccccagcag gcagaagtat 4740 
gcaaagcatg catctcaatt agtcagcaac caggtgtgga aaagtcccca ggctccccag 4800 
caggcagaag tatgcaaagc atgcatctca attagtcagc aaccatagtc ccgcccctaa 4860 
ctccgcccat cccgccccta actccgccca gttccgccca ttctccgccc catggctgac 4920 
taattttttt tatttatgca gaggccgagg ccgcctcggc ctctgagcta ttccagaagt 4980 
agtgaggagg cttttttgga ggcctaggct tttgcaaaaa gctcctcgag gaactgaaaa 5040 
accagaaagt taattcccta tagtgagtcg tattaaattc gtaatcatgg tcatagctgt 5100 
ttcctgtgtg aaattgttat ccgctcacaa ttccacacaa catacgagcc ggaagcataa 5160 
agtgtaaagc ctggggtgcc taatgagtga gctaactcac attaattgcg ttgcgctcac 5220 
tgcccgcttt ccagtcggga aacctgtcgt gccagctgca ttaatgaatc ggccaacgcg 5280 
cggggagagg cggtttgcgt attgggcgct cttccgcttc ctcgctcact gactcgctgc 5340 
gctcggtcgt tcggctgcgg cgagcggtat cagctcactc aaaggcggta atacggttat 5400 
ccacagaatc aggggataac gcaggaaaga acatgtgagc aaaaggccag caaaaggcca 5460 
ggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag gctccgcccc cctgacgagc 5520 
atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc gacaggacta taaagatacc 5580 
aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt tccgaccctg ccgcttaccg 5640 
gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct ttctcaatgc tcacgctgta 5700 
ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg ctgtgtgcac gaaccccccg 5760 
ttcagcccga ccgctgcgcc ttatccggta actatcgtct tgagtccaac ccggtaagac 5820 
acgacttatc gccactggca gcagccactg gtaacaggat tagcagagcg aggtatgtag 5880 
gcggtgctac agagttcttg aagtggtggc ctaactacgg ctacactaga agaacagtat 5940 
ttggtatctg cgctctgctg aagccagtta ccttcggaaa aagagttggt agctcttgat 6000 
ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt ttgcaagcag cagattacgc 6060 
gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc tacggggtct gacgctcagt 6120 
ggaacgaaaa ctcacgttaa gggattttgg tcatgagatt atcaaaaagg atcttcacct 6180 
agatcctttt aaattaaaaa tgaagtttta aatcaatcta aagtatatat gagtaaactt 6240 
ggtctgacag ttaccaatgc ttaatcagtg aggcacctat ctcagcgatc tgtctatttc 6300 
gttcatccat agttgcctga ctccccgtcg tgtagataac tacgatacgg gagggcttac 6360 
catctggccc cagtgctgca atgataccgc gagacccacg ctcaccggct ccagatttat 6420 
cagcaataaa ccagccagcc ggaagggccg agcgcagaag tggtcctgca actttatccg 6480 
cctccatcca gtctattaat tgttgccggg aagctagagt aagtagttcg ccagttaata 6540 
gtttgcgcaa cgttgttgcc attgctacag gcatcgtggt gtcacgctcg tcgtttggta 6600 
tggcttcatt cagctccggt tcccaacgat caaggcgagt tacatgatcc cccatgttgt 6660 
gcaaaaaagc ggttagctcc ttcggtcctc cgatcgttgt cagaagtaag ttggccgcag 6720 
tgttatcact catggttatg gcagcactgc ataattctct tactgtcatg ccatccgtaa 6780 
gatgcttttc tgtgactggt gagtactcaa ccaagtcatt ctgagaatag tgtatgcggc 6840 
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tattlatacf 2S gcgtcaatac SOTataatac cgcgccacat agcagaactt 6900 

tattaaStr rSS? T aaacgttctt cggggcgaaa actctcaagg atcttaccgc 6960 
ctttcaccaa T 9 5 aacccactc ^tgcacccaa ctgatcttca gcatctttta 7020 

cgtttctggg tgagcaaaaa caggaaggca aaatgccgca aaaaagggaa 7080 
taagggcgac acggaaatgt tgaatactca tactcttcct ttttcaatat tattgaloca 7140 
tttatcaggg ttattgtctc atgagcggat acatatttga atgtatttag aaaaSc nil 

tcc ^caca tttccccgaa aagtgccacc tglcgcgccc tgtagcggcg 1260 
cattaagcgc ggcgggtgtg gtggttacgc gcagcgtgac cgctacactt gccagcgccc 7320 

atoaagctct " Ctt « Ctt Cctttctc ^ cacgttcgcc JgctSS llll 

gtcaagctct aaatcggggc atccctttag ggttccgatt tagtgcttta cggcacctcg 7440 
accccaaaaa acttgattag ggtgatggtt cacgtagtgg gccatcgccc t?atagacgg 75oS 
tttttcgccc tttgacgttg gagtccacgt tctttaatag tggactcttg ttccaaacta 7560 
gaacaacact caaccctatc tcggtctatt cttttgattt atLgggat? £tgccgat« ]\ll 

2S25S gagctgattt aacaaaaatt taacgcgaat ^5222 Veil 



<210> 15 

<211> 7303 

<212> DNA 

<213> Homo sapiens 

<400> 15 



cccattcgcc attcaggctg cgcaactgtt gggaagggcg atcggtgcgg gcctcttcgc 60 
tattacgcca gctggcgaaa gggggatgtg ctgcaaggcg attaagttS Stlacgccca 120 
^ ^ c agtcacgacg ttgtaaaacg acggccagtg ccaagctglt ctaatcaata 180 
ttggccatta gccatattat tcattggtta tatagcataa atcaatattg gctattggcc 240 

accgocatc? ESS"? at * ataatat ^acatttat attggctoa? gtccaacatt 300 
tgacattgat tattgactag ttattaatag taatcaatta cggggtcatt 360 

9 ccatatatgg agttocgcgt tacataactt acggtaaatg gcccgcctag 420 
SS5S SKS CCC f tgaC£f tcaatagtga ^tatgttcc catagtaacg 480 
gcaataof 99 2%ZZ 3 acgtcaat ^ fftggagtatt tacggtaaac tgcccacttg 540 
gcagtacatc aagtgtatca tatgccaagt ccgcccccta ttgacgtcaa tgacaotaaa 600 
atctaco^ agCattat9C ccagtacatg accttacggg ag^ttcctac Sgclgtac 660 
cataaaSS SK^T tattaccat ^ Stgatgcggt tttggcagta caccaa^ggg 720 
Zatttnt??* ff^ttsactc acggggattt ccaagtctcc accccattga cgtcaatggj 780 

^ SScaccaaaa tcaacgggac tttocaaaat gtcgtaataa ccocgccccg 840 
ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag agctcgttta 900 
gclcaatato SS* cagatctatc gatctgcagg aLtcaccat 96^ 

ctcaccttta al^t^t tggatg * gaa Stcagaagtt cctgtgggcc tggagccoat 1020 
ctcaccttta gacctaagga cagacctcag gatgatgatg cccgtggtgg accctgttat 1080 
acao gagaag Caattgca ^ c aggaattact tcttatccag cagcagcaac aaatcoagaa "52 
ttt g J.^t g atagcagagt ttcagaaaca gcatgagaac ttgacacggc agcaccaggc 1200 
tcagcttcag gagcatatca aggaacttct agccataaaa cagcaacaag aactcctaga 1260 

acagcagcS SESSS a8ca ™ ca ^caggaa gtagagagg 3 atcgcagaja ill 
acagcagctt cctcctctca gaggcaaaga tagaggacga gaaagggcag tggcaaatao nfln 
agaagtaaag cagaagcttc aagagttcct actgagtaaa ?cagcaac g a agacactcc llll 
ccacaca? 93 ttaaatf" CCgtgagccg ^atcccaag ctc?ggtaca cggctgccca "So 
cacattacca alloT^t™ 9 f Ctccacc ^ttagtgga acatctccat cctacaagta 1560 
cacattacca ggagcacaag atgcaaagga tgatttcccc cttcgaaaaa ctgcctctaa 1620 
gcccaacttg aaggtgcggt ccaggttaaa acagaaagtg gcaglgagga gaagcagccc "sS 
cttactcagg cggaaggatg gaaatgttgt cacttcattc aagaagcgaa tgtttgaaat 1740 
occaaota™ tcag ^ ag t a ocagttctcc aggctctggt ccLgftcac caaacSgg [HI 
g ^ aa S a agtgttact 9 aaaatgagac ttcggttttg ccccctaccc ctcatgccgl 1860 
t££cc££ ^ a r?^ aC gcattctaat tcatgaagat tccatgaacc tgctaagtlt 1920 
ca^octt™ till l gC ccaacattac cttggggctt cccgcagtgc catcccagct 1980 
tattoo^ 9 - aa ^ tCaCtC f aa 9 aaaa 9 ca gaagtgtgag acgcagacgc ttaggcaagg 2040 
tacttta^ 9 ^?" Ca9t atggaggcag catcccggca tcttccagcc accc?catg? 2100 
tactttagag ggaaagccac ccaacagcag ccaccaggct ctcctgcagc atttattatfc ?ifin 

gtctScttg gcfSaaao ^'tt? tgtagc " ggt 9gagt g cc g t taStcctca llll 
gtctcccttg gcaacaaaag agagaatttc acctggcatt agaggtaccc acaaattgcc 2280 
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ccgtcacaga cccctgaacc gaacccagtc 
gctggtcatt caacagcaac accagcaatt 
gatccacatg aacaaactgc tttcgaaatc 
ccttgaggaa gcagaggaag agcttcaggg 
ctctagtggc aacagcacta ggagcgacag 
agttggggct gtgaaggtca aggaggaacc 
ggaaatggaa tctggggagc aggctgcttt 
cacacgtgcg ctctctgtgc gccaagctcc 
gaaacaccgt ctcgtctcca ggactcactc 
agcaatggac cgccccctcc agcctggctc 
gctgaaacac cagtgcgttt gtggcaattc 
acagagtatc tggtcacgac tgcaagaaac 
aggtcgaaaa gccagcctgg aggaaataca 
gtatggcacc aaccccctgg acggacagaa 
ctctcaaaag tttttttcct cattaccttg 
ttggaatgag ctacactcgt ccggtgctgc 
ggcttccaaa gtggcctcag gagagctgaa 
ccatcacgct gaagaatcca cagccatggg 
cgccaaatac ttgagagacc aactaaatat 
tcaccatgga aacggtaccc agcaggcctt 
actccatcgc tatgatgaag ggaacttttt 
gtttatttct ttagagcccc acttttattt 
cggtaccaga ttacaaggac gacgatgaca 
ctccccagtg cctctcctgg ccttggaagt 
aataaaatta agttgcatca ttttgtctga 
aggggggtgg tatggagcaa ggggcccaag 
tctattcggg aaccaagctg gagtgcagtg 
tcctgggttc aagcgattct cctgcctcag 
atgaccaggc tcagctaatt tttgtttttt 
aggctggtct ccaactccta atctcaggtg 
ggattacagg cgtgaaccac tgctcccttc 
cagcaggagg acgtccagac acagcatagg 
ttgagttgct tgcttggcac tgtcctctca 
gaattgggta cgcggccagc ttctgtggaa 
aggctcccca gcaggcagaa gtatgcaaag 
tggaaaagtc cccaggctcc ccagcaggca 
cagcaaccat agtcccgccc ctaactccgc 
cccattctcc gccccatggc tgactaattt 
cggcctctga gctattccag aagtagtgag 
aaaagctcct cgaggaactg aaaaaccaga 
attcgtaatc atggtcatag ctgtttcctg 
acaacatacg agccggaagc ataaagtgta 
tcacattaat tgcgttgcgc tcactgcccg 
tgcattaatg aatcggccaa cgcgcgggga 
cttcctcgct cactgactcg ctgcgctcgg 
actcaaaggc ggtaatacgg ttatccacag 
gagcaaaagg ccagcaaaag gccaggaacc 
ataggctccg cccccctgac gagcatcaca 
acccgacagg actataaaga taccaggcgt 
ctgttccgac cctgccgctt accggatacc 
cgctttctca atgctcacgc tgtaggtatc 
tgggctgtgt gcacgaaccc cccgttcagc 
gtcttgagtc caacccggta agacacgact 
ggattagcag agcgaggtat gtaggcggtg 
acggctacac tagaagaaca gtatttggta 
gaaaaagagt tggtagctct tgatccggca 
ttgtttgcaa gcagcagatt acgcgcagaa 
tttctacggg gtctgacgct cagtggaacg 
gattatcaaa aaggatcttc acctagatcc 
tctaaagtat atatgagtaa acttggtctg 
ctatctcagc gatctgtcta tttcgttcat 
taactacgat acgggagggc ttaccatctg 



tgcacctttg cctcagagca cgttggctca 2340 
cttggagaag cagaagcaat accagcagca 2400 
tattgaacaa ctgaagcaac caggcagtca 2460 
ggaccaggcg atgcaggaag acagagcgcc 2520 
cagtgcttgt gtggatgaca cactgggaca 2580 
agtggacagt gatgaagatg ctcagatcca 2 640 
tatgcaacag cctttcctgg aacccacgca 2700 
gctggctgcg gttggcatgg atggattaga 2760 
ttcccctgct gcctctgttt tacctcaccc 2820 
tgcaactgga attgcctatg accccttgat 2880 
caccacccac cctgagcatg ctggacgaat 2940 
tgggctgcta aataaatgtg agcgaattca 3000 
gcttgttcat tctgaacatc actcactgtt 3060 
gctggacccc aggatactcc taggtgatga 3120 
tggtggactt ggggtggaca gtgacaccat 3180 
acgcatggct gttggctgtg tcatcgagct 3240 
gaatgggttt gctgttgtga ggccccctgg 3300 
gttctgcttt tttaattcag ttgcaattac 3360 
aagcaagata ttgattgtag atctggatgt 3420 
ttatgctgac cccagcatcc tgtacatttc 3480 
ccctggcagt ggagccccaa atgaggttcg 3540 
gtatctttca ggtaattgca ttgcaggatc 3600 
agtagatccc gggtggcatc cctgtgaccc 3660 
tgccactcca gtgcccacca gccttgtcct 3720 
ctaggtgtcc tctataatat tatggggtgg 3780 
ttgggaagac aacctgtagg gcctgcgggg 3840 
gcacaatctt ggctcactgc aatctccgcc 3900 
cctcccgagt tgttgggatt ccaggcatgc 3960 
tggtagagac ggggtttcac catattggcc 402 0 
atctacccac cttggcctcc caaattgctg 4080 
cctgtccttc tgattttaaa ataactatac 4140 
ctacctgcca tggcccaacc ggtgggacat 4200 
tgcgttgggt ccactcagta gatgcctgtt 4260 
tgtgtgtcag ttagggtgtg gaaagtcccc 4320 
catgcatctc aattagtcag caaccaggtg 4380 
gaagtatgca aagcatgcat ctcaattagt 4440 
ccatcccgcc cctaactccg cccagttccg 4500 
tttttattta tgcagaggcc gaggccgcct 4560 
gaggcttttt tggaggccta ggcttttgca 4620 
aagttaattc cctatagtga gtcgtattaa 4680 
tgtgaaattg ttatccgctc acaattccac 4740 
aagcctgggg tgcctaatga gtgagctaac 4800 
ctttccagtc gggaaacctg tcgtgccagc 4860 
gaggcggttt gcgtattggg cgctcttccg 4920 
tcgttcggct gcggcgagcg gtatcagctc 4980 
aatcagggga taacgcagga aagaacatgt 5040 
gtaaaaaggc cgcgttgctg gcgtttttcc 5100 
aaaatcgacg ctcaagtcag aggtggcgaa 5160 
ttccccctgg aagctccctc gtgcgctctc 5220 
tgtccgcctt tctcccttcg ggaagcgtgg 5280 
tcagttcggt gtaggtcgtt cgctccaagc 5340 
ccgaccgctg cgccttatcc ggtaactatc 5400 
tatcgccact ggcagcagcc actggtaaca 5460 
ctacagagtt cttgaagtgg tggcctaact 5520 
tctgcgctct gctgaagcca gttaccttcg 5580 
aacaaaccac cgctggtagc ggtggttttt 5640 
aaaaaggatc tcaagaagat cctttgatct 5700 
aaaactcacg ttaagggatt ttggtcatga 57 60 
ttttaaatta aaaatgaagt tttaaatcaa 5820 
acagttacca atgcttaatc agtgaggcac 5880 
ccatagttgc ctgactcccc gtcgtgtaga 5940 
gccccagtgc tgcaatgata ccgcgagacc 6000 
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cacgctcacc ggctccagat ttatcagcaa taaaccagcc agccggaagg accaaacaca finsn 
gaagtggtcc tgcaacttta tccgcctcca tccagtctat taattjttgc caaaaaacta 5i?J 
gagtaagtag ttcgccagtt aatagtttgc gcaacgttgt tgcca^ct acSgcatcg Sill 
SSEnT c P cgtc * fctt Wtatggctt cattcagctc cjgttcccaa cga?caaggc till 
5£™£ t CC ^ Catg Wgcaaaa aagcggttag ctccttcggt cctccgatcg 6300 
ctcttacta? S T g * agtg " at cactcatggt tatggcagca ctgcataatt 6360 
cattctaaaa ESS^ gtaagatgct "tctgtgac tggtgagtac tcaaccaagt 6420 
cattctgaga atagtgtatg cggcgaccga gttgctcttg cccggcgtca atacaoaata fid fin 
ataccgcgcc acatagcaga actttaaaag tgctcatca? tggaaaacgt tcttcg?ggc etlo 
gaaaactctc aaggatctta ccgctgttga gatccagttc gatgtaaccc actcg^acac 660*0 
ggcaaXSc Sf^' ttta f ttca ccagcgtttc tggJtgagS S££5E SSSS 
^cctttttca 2SE?£ ff ggaa ^ aggg c ^cacggaa atgttgaata ctcatactct 6720 
ttaaat^ta? ^ agcatttatc agggttattg tctcatgagc ggatacatat 6780 

ttagaaaaat aaacaaatag gggttccgcg cacatttccc cgaaaagtgc 6840 
tSecSSE gC ^ tgtagc ^STcattaa gcgcggcggg tgtggtggtt acgcgclgcg 69oS 
tgaccgctac acttgccagc gccctagcgc ccgctccttt cgctttcttc ccttcctttc 6QfiO 

Pjrccggcttt ccccgtcaag ctctaaatcg gggcatccct ttajggt"c 7oIo 
gatttagtgc tttacggcac ctcgacccca aaaaacttga ttagggtgat ggttcacata 7080 
gtgggccatc gccctgatag acggtttttc gccctttgac gttggKcc acgttct?Ja ?JJS 
atagtggact cttgttccaa actggaacaa cactcaaccc tatctcggtc taScttttg nil 
atttataagg gattttgccg atttcggcct attggttaaa aaatga^ctg atttaaca2 726^ 
aatttaacgc gaattttaac aaaatattaa acgtttacaa ttt 9 Ub0 



7303 



<210> 16 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer used to amplify human DNA 
<400> 16 

ccatggaaac ggtacccagc aggc 24 

<210> 17 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer used to amplify human DNA 
<400> 17 

cactccatcg ctatgatgaa ggg 23 

<210> 18 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer used to amplify human DNA 
<400> 18 

agttcccttc atcatagcga tgg 23 

<210> 19 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Primer used to amplify human DNA 
<400> 19 

aatgtacagg atgctggggt 

<210> 20 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer used to amplify human DNA 
<400> 20 

cccttgtagc tggtggagtt ccctt 

<210> 21 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer used to amplify human DNA 
<400> 21 

tgtgtcatcg agctggcttc 

<210> 22 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer used to amplify human DNA 
<400> 22 

atcttctgca agtggctcca 



WO 02/102984 



PCT/US02/19051 



7/25 



Thr His Ser Ser Pro Ala Ala Ser Val Leu Pro His Pro Ala Met Asp 

615 620 
Arg Pro Leu Gin Pro Gly Ser Ala Thr Gly lie Ala Tyr Asp Pro Leu 

boO 635 ca 

Met Leu Lys His Gin Cys Val Cys Gly Asn Ser Thr Thr His Pro III 



645 650 655 

Thr 

Leu Leu Asn Lys Cys Glu Arg lie Gl"n Gly Arg Lys Ala Ser Leu Glu 



His Ala Gly Arg lie Gin Ser lie Trp Ser Arg Leu Gin Glu Thr Gly 

665 



Glu lie Gin Leu Val His Ser gTu His His Ser Leu Leu Tyr Gly Thr 

705 ASP Gly ?JJ LYS L6U As * Pro Leu Leu Gly Asp 

Asp ser Gin Lys Phe Phe Ser Ser Leu Pro Ss Gly Gly Leu Gly III 



730 



Asp Ser Asp Thr lie Trp Asn Glu Leu Ms Ser Ser Gly Ala III Arg 



740 745 



Met Ala Val Gly Cys Val lie Glu Leu Ala Ser Lys Val Ala Ser Gly 



760 



Glu Leu Lys Asn Gly Phe Ala Val Val Arg Pro Pro Gly His His Ala 

780 

Glu Glu Ser Thr Ala Met Gly Phe Cys Phe Phe Asn Ser Val Ala lie 

795 Qnn 

Thr Ala Lys Tyr Leu Arg Asp Gin Leu Asn lie Ser Lys lie Leu III 

810 

Val Asp Leu Asp Val His His Gly Asn Gly Thr Gin Gin Ala Phe Tyr 

Ala Asp Pro Ser He Leu Tyr lie III Leu His Arg Tyr Asp Glu Gly 

Asn Phe Phe Pro Gly Ser Gly Ala Pro Asn Glu Val Arg Phe lie Ser 

8 5 5 860 
Gly 
875 



Jeu Glu Pro His Phe Tyr Leu Tyr Leu Ser Gly Asn Cys He Ala 



<210> 5 

<211> 3054 

<212> DNA 

<213> Homo sapiens 

<400> 5 



?Sg«??t SS.SS Ca3 ;"»»»» »«g,caccg gctggagcaa cttgcaggac 60 

2SK5 Ff F ™- "™ SEE 23K5S SS 
USES esse S5SSS ssss ~ 

cttcttatcc agcagcagca acaaatccag aagcagcttc taatannZrrZ ZZZ 

mmmmmmi 

SS aaetc?™? 0 ga.alaafaa «"Sga£ 660 

c"«tagS SSSS SSS SSJSS SSSc. toa ™ 

sssss ssss sss ? ™" ™« - 

Sill a«~ ilHH III =1 iiE is 

gc tactttagag ggaaagccac ccaacagcag ccaccaggct 1200 
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ctcctgcagc atttattatt gaaagaacaa atgcgacagc aaaagcttct tgtagctggt 1260 
ggagttccct tacatcctca gtctcccttg gcaacaaaag agagaatttc acctggcatt 1320 
agaggtaccc acaaattgcc ccgtcacaga cccctgaacc gaacccagtc tgcacctttg 1380 
cctcagagca cgttggctca gctggtcatt caacagcaac accagcaatt cttggagaag 1440 
cagaagcaat accagcagca gatccacatg aacaaactgc tttcgaaatc tattgaacaa 1500 
ctgaagcaac caggcagtca ccttgaggaa gcagaggaag agcttcaggg ggaccaggcg 1560 
atgcaggaag acagagcgcc ctctagtggc aacagcacta ggagcgacag cagtgcttgt 1620 
gtggatgaca cactgggaca agttggggct gtgaaggtca aggaggaacc agtggacagt 1680 
gatgaagatg ctcagatcca ggaaatggaa tctggggagc aggctgcttt tatgcaacag 1740 
cctttcctgg aacccacgca cacacgtgcg ctctctgtgc gccaagctcc gctggctgcg 1800 
gttggcatgg atggattaga gaaacaccgt ctcgtctcca ggactcactc ttcccctgct 1860 
gcctctgttt tacctcaccc agcaatggac cgccccctcc agcctggctc tgcaactgga 1920 
attgcctatg accccttgat gctgaaacac cagtgcgttt gtggcaattc caccacccac 1980 
cctgagcatg ctggacgaat acagagtatc tggtcacgac tgcaagaaac tgggctgcta 2040 
aataaatgtg agcgaattca aggtcgaaaa gccagcctgg aggaaataca gcttgttcat 2100 
tctgaacatc actcactgtt gtatggcacc aaccccctgg acggacagaa gctggacccc 2160 
aggatactcc taggtgatga ctctcaaaag tttttttcct cattaccttg tggtggactt 2220 
ggggtggaca gtgacaccat ttggaatgag ctacactcgt ccggtgctgc acgcatggct 2280 
gttggctgtg tcatcgagct ggcttccaaa gtggcctcag gagagctgaa gaatgggttt 2340 
gctgttgtga ggccccctgg ccatcacgct gaagaatcca cagccatggg gttctgcttt 2400 
tttaattcag ttgcaattac cgccaaatac ttgagagacc aactaaatat aagcaagata 2460 
ttgattgtag atctggatgt tcaccatgga aacggtaccc agcaggcctt ttatgctgac 2520 
cccagcatcc tgtacatttc actccatcgc tatgatgaag ggaacttttt ccctggcagt 2580 
ggagccccaa atgaggttgg aacaggcctt ggagaagggt acaatataaa tattgcctgg 2640 
acaggtggcc ttgatcctcc catgggagat gttgagtacc ttgaagcatt caggaccatc 2700 
gtgaagcctg tggccaaaga gtttgatcca gacatggtct tagtatctgc tggatttgat 2760 
gcattggaag gccacacccc tcctctagga gggtacaaag tgacggcaaa atgttttggt 2820 
catttgacga agcaattgat gacattggct gatggacgtg tggtgttggc tctagaagga 2880 
ggacatgatc tcacagccat ctgtgatgca tcagaagcct gtgtaaatgc ccttctagga 2940 
aatgagctgg agccacttgc agaagatatt ctccaccaaa gcccgaatat gaatgctgtt 3000 
atttctttac agaagatcat tgaaattcaa agtatgtctt taaagttctc ttaa 3054 

<210> 6 

<211> 967 

<212> PRT 

<213> Homo sapiens 

<400> 6 



Met 


His 


Ser 


Met 


He 


Ser 


Ser 


Val 


Asp 


Val 


Lys 


Ser 


Glu 


Val 


Pro 


Val 


1 








5 










10 










15 




Gly 


Leu 


Glu 


Pro 


He 


Ser 


Pro 


Leu 


Asp 


Leu 


Arg 


Thr 


Asp 


Leu 


Arg 


Met 








20 










25 










30 






Met 


Met 


Pro 


Val 


Val 


Asp 


Pro 


Val 


Val 


Arg 


Glu 


Lys 


Gin 


Leu 


Gin 


Gin 






35 










40 










45 








Glu 


Leu 


Leu 


Leu 


He 


Gin 


Gin 


Gin 


Gin 


Gin 


He 


Gin 


Lys 


Gin 


Leu 


Leu 




50 










55 










60 










He 


Ala 


Glu 


Phe 


Gin 


Lys 


Gin 


His 


Glu 


Asn 


Leu 


Thr 


Arg 


Gin 


His 


Gin 


65 










70 










75 










80 


Ala 


Gin 


Leu 


Gin 


Glu 


His 


He 


Lys 


Glu 


Leu 


Leu 


Ala 


lie 


Lys 


Gin 


Gin 










85 










90 










95 




Gin 


Glu 


Leu 


Leu 


Glu 


Lys 


Glu 


Gin 


Lys 


Leu 


Glu 


Gin 


Gin 


Arg 


Gin 


Glu 








100 










105 










110 






Gin 


Glu 


Val 


Glu 


Arg 


His 


Arg 


Arg 


Glu 


Gin 


Gin 


Leu 


Pro 


Pro 


Leu 


Arg 






115 










120 










125 








Gly 


Lys 


Asp 


Arg 


Gly 


Arg 


Glu 


Arg 


Ala 


Val 


Ala 


Ser 


Thr 


Glu 


Val 


Lys 




130 










135 










140 










Gin 


Lys 


Leu 


Gin 


Glu 


Phe 


Leu 


Leu 


Ser 


Lys 


Ser 


Ala 


Thr 


Lys 


Asp 


Thr 


145 










150 










155 










160 


Pro 


Thr 


Asn 


Gly 


Lys 


Asn 


His 


Ser 


Val 


Ser 


Arg 


His 


Pro 


Lys 


Leu 


Trp 










165 










170 










175 




Tyr 


Thr 


Ala 


Ala 


His 


His 


Thr 


Ser 


Leu 


Asp 


Gin 


Ser 


Ser 


Pro 


Pro 


Leu 



180 185 190 
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Ser Gly Thr Ser Pro Ser Tyr Lys Tyr Thr Leu Pro Gly Ala Gin Asp 

195 200 205 

Ala Lys Asp Asp Phe Pro Leu Arg Lys Thr Glu Ser Ser Val Ser Ser 

210 215 220 

Ser Ser Pro Gly Ser Gly Pro Ser Ser Pro Asn Asn Gly Pro Thr Gly 

230 235 
Ser Val Thr Glu Asn Glu Thr Ser Val Leu Pro Pro Thr Pro His Ala 

245 250 ?ss 

Glu Gin Met Val Ser Gin Gin Arg lie Leu He His Glu Asp Ser Met 

260 265 270 

Asn Leu Leu Ser Leu Tyr Thr Ser Pro Ser Leu Pro Asn lie Thr Leu 

275 280 285 

Gly Leu Pro Ala Val Pro Ser Gin Leu Asn Ala Ser Asn Ser Leu Lys 

295 300 
. Glu Lys Gin Lys Cys Glu Thr Gin Thr Leu Arg Gin Gly Val Pro Leu 

310 315 ~ 

Pro Gly Gin Tyr Gly Gly Ser He Pro Ala Ser Ser Ser His Pro His 

325 330 335 

Val Thr Leu Glu Gly Lys Pro Pro Asn Ser Ser His Gin Ala Leu Leu 

345 35Q 
Gin His Leu Leu Leu Lys Glu Gin Met Arg Gin Gin Lys Leu Leu Val 

355 360 365 

Ala Gly Gly Val Pro Leu His Pro Gin Ser Pro Leu Ala Thr Lys Glu 

370 375 380 

Arg lie Ser Pro Gly lie Arg Gly Thr His Lys Leu Pro Arg His Arg 

395 /lc\ r\ 

Pro Leu Asn Arg Thr Gin Ser Ala Pro Leu Pro Gin Ser Thr Leu Ala 

~n T 405 4 10 415 

Gin Leu Val lie Gin Gin Gin His Gin Gin Phe Leu Glu Lys Gin Lys 

425 430 
Gin Tyr Gin Gin Gin He His Met Asn Lys Leu Leu Ser Lys Ser He 

440 445 
Glu Gin Leu Lys Gin Pro Gly Ser His Leu Glu Glu Ala Glu Glu Glu 

30 455 460 

Leu Gin Gly Asp Gin Ala Met Gin Glu Asp Arg Ala Pro Ser Ser Gly 

470 475 /on 

Asn Ser Thr Arg Ser Asp Ser Ser Ala Cys Val Asp Asp Thr Leu Gly 

485 490 495 

Gin Val Gly Ala Val Lys Val Lys Glu Glu Pro Val Asp Ser Asp Glu 

500 505 5io 

Asp Ala Gin He Gin Glu Met Glu Ser Gly Glu Gin Ala Ala Phe Met 

515 520 525 

Gin Gin Pro Phe Leu Glu Pro Thr His Thr Arg Ala Leu Ser Val Arg 

535 . 540 
Gin Ala Pro Leu Ala Ala Val Gly Met Asp Gly Leu Glu Lys His Arg 

550 555 560 

Leu Val Ser Arg Thr His Ser Ser Pro Ala Ala Ser Val Leu Pro His 

565 570 575 

Pro Ala Met Asp Arg Pro Leu Gin Pro Gly Ser Ala Thr Gly He Ala 
_ 580 585 590 

Tyr Asp Pro Leu Met Leu Lys His Gin Cys Val Cys Gly Asn Ser Thr 

. 600 605 

Thr His Pro Glu His Ala Gly Arg He Gin Ser He Trp Ser Arg Leu 

615 620 
Gin Glu Thr Gly Leu Leu Asn Lys Cys Glu Arg He Gin Gly Arg Lys 

630 635 640 

Ala Ser Leu Glu Glu He Gin Leu Val His Ser Glu His His Ser Leu 

645 650 655 

Leu Tyr Gly Thr Asn Pro Leu Asp Gly Gin Lys Leu Asp Pro Arg He 

660 665 670 

Leu Leu Gly Asp Asp Ser Gin Lys Phe Phe Ser Ser Leu Pro Cvs Glv 
675 680 685 
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Glv 


Leu 


Glv 


Val 


Asn Ser Asd Thr 




\j j \j 






695 


Gly 


Ala 


Ala 




Met Ala Val Glv 


705 








710 


Val 


Ala 


Ser 


Gly 


Glu Leu Lvs Asn 










725 


Glv 


His 


His 


Ala 


Glu Glu Ser Thr 








740 






Val 


Ala 


lie 


Thr Ala Lvs Tvr 










760 




lie 


Leu 


lie 


Val Aqd Leu Aso 




770 






775 


Gin 


Ala 


Phe 


Tvr 

A y *• 


Ala Aso Pro Ser 


785 








790 




Asp 


Glu 


Glv 


Acjti Phe Phe Pro 

noil nic fjiic ti w 












Gly 


Thr 


(31 v 
vj -i-_y 


Leu 
















Leu 


Asp 


Jrx. L/ 


iri u rut; u oxy 






OJJ 






inr 


xxe 


vax 


Lys 


IrrO Val Ala XiyS 










O J J 


Val 


Ser 


Ala 


Glv 


Phe Asd Ala Leu 


865 








870 


Gly 


Tyr 


Lys 


Val 


Thr Ala Lys Cys 










885 


Met 


Thr 


Leu 


Ala 


Asp Gly Arg Val 








900 




Asp 


Leu 


Thr 


Ala 


lie Cys Asp Ala 






915 




920 


Leu 


Gly 


Asn 


Glu 


Leu Glu Pro Leu 




930 






935 


Pro 


Asn 


Met 


Asn 


Ala Val lie Ser 


945 








950 


Ser 


Met 


Ser 


Leu 


Lys Phe Ser 



965 



He 


Trp 


Asn Glu Leu His 


Ser 


Ser 






700 






Cys 


Val 


He Glu Leu Ala 


Ser 


Lys 






715 




720 


Gly 


Phe 


Ala Val Val Ara 


Pro 


Pro 




730 




735 




Ala Met 


Glv Phe Cvs Phe 


Phe 


Asn 


745 




750 






Leu Arg 


Asn Gin Leu Asn 

JJ^U 


He 


Ser 






765 






Val 


His 


His Glv Asn Glv 

ilAO VJ-i.Jr iivJl J. VJJ.JT 


Thr 


Gin 






780 






He 


Leu 


Tvr Tie Ser Leu 


His 


Arg 






795 




800 


Gly Ser 


Glv Ala Pro Asn 


Glu 


Val 




810 




815 




Asn 


He 


a ori Tie Al s Tm 

null J L. C iiXQ. 1 LJ^ 


Thr Gly 


825 




830 






Val 


Glu 


TH T.on f^l n Ala 
ijfi JJCU uxu nla 


Phe 


Arg 






O fx «J 






Glu 


Phe 


A csr% Prrt lor\ Mot" 


Val 


Leu 






r fin 






Glu Gly 


His Thr Pro Pro 


Leu Gly 






875 




880 


Phe Gly 


His Leu Thr Lys 


Gin 


Leu 




890 




895 




Val 


Leu 


Ala Leu Glu Gly 


Gly His 


905 




910 






Ser 


Glu 


Ala Cys Val Asn 


Ala 


Leu 






925 






Ala 


Glu 


Asp He Leu His 


Gin 


Ser 






940 






Leu 


Gin 


Lys He He Glu 


He 


Gin 






955 




960 



<210> 7 

<211> 3367 

<212> DNA 

<213> Homo sapiens 

<400> 7 

ggggaagaga ggcacagaca cagataggag aagggcaccg gctggagcca cttgcaggac 60 

tgagggtttt tgcaacaaaa ccctagcagc ctgaagaact ctaagccaga tggggtggct 120 

ggacgagagc agctcttggc tcagcaaaga atgcacagta tgatcagctc agtggatgtg 180 

aagtcagaag ttcctgtggg cctggagccc atctcacctt tagacctaag gacagacctc 240 

aggatgatga tgcccgtggt ggaccctgtt gtccgtgaga agcaattgca gcaggaatta 3 00 

cttcttatcc agcagcagca acaaatccag aagcagcttc tgatagcaga gtttcagaaa 360 

cagcatgaga acttgacacg gcagcaccag gctcagcttc aggagcatat caaggaactt 420 

ctagccataa aacagcaaca agaactccta gaaaaggagc agaaactgga gcagcagagg 480 

caagaacagg aagtagagag gcatcgcaga gaacagcagc ttcctcctct cagaggcaaa 540 

gatagaggac gagaaagggc agtggcaagt acagaagtaa agcagaagct tcaagagttc 600 

ctactgagta aatcagcaac gaaagacact ccaactaatg gaaaaaatca ttccgtgagc 660 

cgccatccca agctctggta cacggctgcc caccacacat cattggatca aagctctcca 720 

ccccttagtg gaacatctcc atcctacaag tacacattac caggagcaca agatgcaaag 780 

gatgatttcc cccttcgaaa aactgaatcc tcagtcagta gcagttctcc aggctctggt 840 

cccagttcac caaacaatgg gccaactgga agtgttactg aaaatgagac ttcggttttg 900 

ccccctaccc ctcatgccga gcaaatggtt tcacagcaac gcattctaat tcatgaagat 960 

tccatgaacc tgctaagtct ttatacctct ccttctttgc ccaacattac cttggggctt 1020 

cccgcagtgc catcccagct caatgcttcg aattcactca aagaaaagca gaagtgtgag 1080 
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acgcagacgc 
tcttccagcc 
ctcctgcagc 
ggagttccct 
agaggtaccc 
cctcagagca 
cagaagcaat 
ctgaagcaac 
atgcaggaag 
gtggatgaca 
gatgaagatg 
cctttcctgg 
gttggcatgg 
gcctctgttt 
attgcctatg 
cctgagcatg 
aataaatgtg 
tctgaacatc 
aggatactcc 

ggggtggaca 

gttggctgtg 
gctgttgtga 
tttaattcag 
ttgattgtag 
cccagcatcc 
ggagccccaa 
ggtaattgca 
acgagattac 
gagcactgtt 
gagctcccat 
ccaccgtggt 
aaatattgcc 
attcaggacc 
tgctggattt 
aaaatgtttt 
ggctctagaa 
tgcccttcta 
tatgaatgct 
ctcttaa 



ttaggcaagg 
accctcatgt 
atttattatt 
tacatcctca 
acaaattgcc 
cgttggctca 
accagcagca 
caggcagtca 
acagagcgcc 
cactgggaca 
ctcagatcca 
aacccacgca 
atggattaga 
tacctcaccc 
accccttgat 
ctggacgaat 
agcgaattca 
actcactgtt 
taggtgatga 
gtgacaccat 
tcatcgagct 
ggccccctgg 
ttgcaattac 
atctggatgt 
tgtacatttc 
atgaggttcg 
ttgcatgatt 
tgaattgtcc 
tgtcagggaa 
gtgcttgttt 
gtgtctttct 
tggacaggtg 
atcgtgaagc 
gatgcattgg 
ggtcatttga 
ggaggacatg 
ggaaatgagc 
gttatttctt 



tgttcctctg 
tactttagag 
gaaagaacaa 
gtctcccttg 
ccgtcacaga 
gctggtcatt 
gatccacatg 
ccttgaggaa 
ctctagtggc 
agttggggct 
ggaaatggaa 
cacacgtgcg 
gaaacaccgt 
agcaatggac 
gctgaaacac 
acagagtatc 
aggtcgaaaa 
gtatggcacc 
ctctcaaaag 
ttggaatgag 
ggcttccaaa 
ccatcacgct 
cgccaaatac 
tcaccatgga 
actccatcgc 
gtttatttct 
acccctaatt 
catgggacca 
ggttgggctg 
tcctctcttc 
cttcccaggt 
gccttgatcc 
ctgtggccaa 
aaggccacac 
cgaagcaatt 
atctcacagc 
tggagccact 
tacagaagat 



cctgggcagt 
ggaaagccac 
atgcgacagc 
gcaacaaaag 
cccctgaacc 
caacagcaac 
aacaaactgc 
gcagaggaag 
aacagcacta 
gtgaaggtca 
tctggggagc 
ctctctgtgc 
ctcgtctcca 
cgccccctcc 
cagtgcgttt 
tggtcacgac 
gccagcctgg 
aaccccctgg 
tttttttcct 
ctacactcgt 
gtggcctcag 
gaagaatcca 
ttgagagacc 
aacggtaccc 
tatgatgaag 
ttagagcccc 
ttcttgtcct 
agaaccagtg 
atttgatgtg 
ttgctttctt 
tggaacaggc 
tcccatggga 
agagtttgat 
ccctcctcta 
gatgacattg 
catctgtgat 
tgcagaagat 
cattgaaatt 



atggaggcag 
ccaacagcag 
aaaagcttct 
agagaatttc 
gaacccagtc 
accagcaatt 
tttcgaaatc 
agcttcaggg 
ggagcgacag 
aggaggaacc 
aggctgcttt 
gccaagctcc 
ggactcactc 
agcctggctc 
gtggcaattc 
tgcaagaaac 
aggaaataca 
acggacagaa 
cattaccttg 
ccggtgctgc 
gagagctgaa 
cagccatggg 
aactaaatat 
agcaggcctt 
ggaacttttt 
acttttattt 
ttgctggtgt 
cagaacaagfc 
ttgtttgatg 
ccatttgctc 
cttggagaag 
gatgttgagt 
ccagacatgg 
ggagggtaca 
gctgatggac 
gcatcagaag 
attctccacc 
caaagtatgt 



catcccggca 
ccaccaggct 
tgtagctggt 
acctggcatt 
tgcacctttg 
cttggagaag 
tattgaacaa 
ggaccaggcg 
cagtgcttgt 
agtggacagt 
tatgcaacag 
gctggctgcg 
ttcccctgct 
tgcaactgga 
caccacccac 

tgggctgcta 

gcttgttcat 
gctggacccc 
tggtggactt 
acgcatggct 
gaatgggttt 
gttctgcttt 
aagcaagata 
ttatgctgac 
ccctggcagt 
gtatctttca 
tttaaattac 
gcataaccca 
tttatttcaa 
tcttctctgc 
ggtacaatat 
accttgaagc 
tcttagtatc 
aagtgacggc 
gtgtggtgtt 
cctgtgtaaa 
aaagcccgaa 
ctttaaagtt 



1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3367 



<210> 8 

<211> 835 

<212> PRT 

<213> Homo sapiens 



<400> 8 
Met His Ser 
1 

Gly Leu Glu 

Met Met Pro 
35 

Glu Leu Leu 
50 

He Ala Glu 
65 

Ala Gin Leu 

Gin Glu Leu 

Gin Glu Val 
115 



Met He 
5 

Pro He 
20 

Val Val 

Leu He 

Phe Gin 

Gin Glu 

85 
Leu Glu 
100 

Glu Arg 



Ser Ser Val 

Ser Pro Leu 

Asp Pro Val 
40 

Gin Gin Gin 
55 

Lys Gin His 
70 

His He Lys 

Lys Glu Gin 

His Arg Arg 
120 



Asp Val 
10 

Asp Leu 
25 

Val Arg 

Gin Gin 

Glu Asn 

Glu Leu 

90 
Lys Leu 
105 

Glu Gin 



Lys Ser Glu Val 
Arg Thr 
Glu Lys 



He Gin 
60 

Leu Thr 
75 

Leu Ala 
Glu Gin 
Gin Leu 



Asp Leu 

30 
Gin Leu 
45 

Lys Gin 



Arg Gin 

He Lys 

Gin Arg 
110 
Pro Pro 
125 



Pro Val 
15 

Arg Met 

Gin Gin 

Leu Leu 

His Gin 

80 
Gin Gin 
95 

Gin Glu 
Leu Arg 
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Glv 


Lys 


Asp 


Arg Gly Arg 




130 






Gin 


Lys 


Leu 


Gin Glu Phe 


145 






150 


Pro 


Thr 


Asn 


Glv Lvs Asn 








165 


Tvr 


Thr 


Ala 


Ala His His 

"AH i^-kO XIAO 








180 


Ser 


Glv 


Thr 


Ser Pro Ser 






195 




Ala 


Lvs 


Asp 


Asp Phe Pro 




210 






Ser 


Ser 


Pro 


Glv Ser Glv 


225 






230 


Ser 


Val 


Thr 


gi ii Asn gi ii 

VJ-L. U nail U-LU 








245 


Glu 


Gin 


Met 


Val Ser- Gin 








260 


Asn 


LeU 


Leu 


Sot* TiPii TH/r* 
uci jjcu iyx 










Gly 


Tion 
ucu 


Dv*n 
XT J. VJ 


iiiu val rxu 




ci y \j 






Gl ii 


Lys 






305 








Pico 




gi n 


iyi \jj~y Kyxy 










Val 


X 111 


JJCU 


Gin Gl T."\/^c! 
vjiu v3 J.y uy o 










Glh 


His 


Leu 


T.pn T.pn T^'VT'c* 

JJCU J-JC-U. JJj O 






J J J 




Ala 


Gly 


Gly 


vd-L iri vj Ucu 












lie 


Ser 


Pro Glv Tie 


385 






390 


Pro 


Leu 


Asn 


A Y~rf Tlrr Gl ti 
nxy J. j. ix. uiix 










Gin 


Leu 


Val 


Tie Gl n Gl n 

lie VJ IU U1U 










Gl rt 


j-yr 


V7111 


Gl n ml -n Tlo 

Vjlll OJ.ll llC 










Glu 


Gl n 


Leu 


uy o oiii nu 




450 






Leu 


Gl n 

VJ1 J.1 


gi v 

vjiy 


Sen fill ri Al a 
rtSjJ vjiii nia 


465 






470 


Asn 


Ser 


Thr 


At~ct Set" Asn 








485 


Gin 


Val 


Glv 


Ala Val Lvs 








500 


Asp 


Ala 


Gin 


Tie Gin Glu 






515 




Gin 


Gin 


Pro 


Phe Leu Glu 




530 






Gin 


Ala 


Pro 


Leu Ala Ala 


545 






550 


Leu 


Val 


Ser 


Arg Thr His 








565 


Pro 


Ala 


Met 


Asp Arg Pro 








580 


Tyr 


Asp 


Pro 


Leu Met Leu 






595 




Thr 


His 


Pro 


Glu His Ala 



610 
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Glu 


Arg 


Ala 


Val 


Ala 


Ser 


135 










140 


Leu 


Leu 


Ser 


Lys 


Ser 


Ala 










155 




His 


Ser 


Val 


Ser 


Arg 


His 








170 






Thr 


Ser 


Leu Asp 


Gin 


Ser 






185 








Tyr Lys 


Tyr Thr 


Leu 


Pro 




200 










Leu Arg 


Lys 


Tnr 


Glu 


Ser 


215 










220 


Pro 


Ser 


Ser 


Pro 


Asn 


Asn 










235 




Thr 


Ser 


vai 


Leu 


Pro 


Pro 








OCA 






Gin Arg 


lie 


Leu 


lie 


His 






265 








Thr 


Ser 


Pro 


Ser 


Leu 


Pro 




280 










Ser 


Gin 


Leu 


Asn 


Ala 


Ser 


295 










inn 


Thr 


Gin 


Thr 


Leu 


Arg 


bill 










ju 




Ser 


lie 


Pro 


Ala 


Cor 


Coy* 
Del 








330 






Pro 


Pro 


Asn 


Ser 


Ser 


ni is 






345 








Glu 


Gin 


Met 


Arg 


Gin 


Gin 




360 










His 


Pro 


Gin 


Ser 


Pro 


Leu 


375 










380 


Arg 


Gly 


Thr 


His 


Lys 


Leu 










395 




Ser 


Ala 


Pro 


Leu 


Pro 


Gin 








410 






Gin 


His 


Gin 


Gin 


Phe 


Leu 






425 








His 


Met 


Asn Lys 


Leu 


Leu 




440 










Gly Ser 


His 


Leu 


Glu 


Glu 


455 










460 


Met 


Gin 


Glu Asp 


Arg Ala 










475 




Ser 


Ser 


Ala Cys 


Val 


Asp 








490 






Val 


Lys 


Glu Glu 


Pro Val 






505 








Met 


Glu 


Ser Gly 


Glu 


Gin 




520 










Pro 


Thr 


His 


Thr 


Arg Ala 


535 










540 


Val 


Gly 


Met Asp 


Gly Leu 










555 




Ser 


Ser 


Pro Ala 


Ala 


Ser 








570 






Leu 


Gin 


Pro Gly 


Ser Ala 






585 








Lys 


His 


Gin Cys 


Val 


Cys 




600 










Gly Arg 


lie 


Gin 


Ser 


lie 


615 










620 



PCT/US02/19051 



Thr 


Glu 


Vai 


Lys 


Thr Lys 


Asp 


Thr 








160 


Pro Lys 


Leu 


Trp 






175 




Ser 


Pro 


Pro 


Leu 




190 






Gly Ala 


Gin Asp 


205 








Ser Val 


Ser 


Ser 


Gly Pro 


Thr Gly 








*"> A f\ 

<s4U 


Thr 


Pro 


His 


Ala 






255 




Glu Asp 


Ser 


Met 




270 






Asn- lie 


Thr 


Leu 


285 








Asn 


Ser 


Leu 


Lys 




vai 


Pro 


Leu 








320 


Ser 


TT -J — 

HIS 


Pro 


His 






335 






Alp 
Aid 


Leu 


Leu 




j jU 






T 

Lys 


Leu 


Leu 


Val 


365 








Ala 


Thr 


Lys 


Glu 


Pro 


Arg 


His 


Arg 








400 


Ser 


Thr 


Leu 


Ala 






415 




Glu 


Lys 


Gin 


Lys 




430 






Ser 


Lys 


Ser 


He 


445 








Ala 


Glu 


Glu 


Glu 


Pro 


Ser 


Ser Gly 








480 


Asp 


Thr 


Leu Gly 






495 




Asp 


Ser 


Asp Glu 




510 






Ala 


Ala 


Phe 


Met 


525 








Leu 


Ser 


Val Arg 


Glu 


Lys 


His 


Arg 








560 


Val 


Leu 


Pro 


His 






575 




Thr 


Gly 


lie Ala 




590 






Gly 


Asn 


Ser 


Thr 


605 








Trp 


Ser 


Arg 


Leu 
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Gin 




X 111- 


\3±y x*eu j_>eu 


Asn Lys 


625 








Ala 


Ser 


Leu 


vaxu (3d. LL 11c 


uxn x»eu 








Of±J 




Leu 






x Ilx. ASu JrlvO 


ijeu Asp 








vj O \J 




Leu 


AJCU 




Asp /isp ber 


Gin Lys 






67R 




/TO A 


Gly 






vax f\sp ber 


Asp Thr 










fTQC 

by j 


Gly 


At a 
Ala 




Arg jyiec ax a 


vax oxy 


705 






Tin 
/ xu 


V CI -L 




ser 


bxy {jJlu Leu 


Lys Asn 








/ZD 




m T7- 

\j JLy 


JtlXS 


HXS 


Ala Ca±u Glu 


Ser Thr 








/ 40 






Vdl 


ax a 


xxe Tnr Ala 


Lys Tyr 






755 




760 


Lys 


lie 


Leu 


He Val Asp 


Leu Asp 




770 






775 


Gin 


Ala 


Phe 


Tyr Ala Asp 


Pro Ser 


785 






790 




Tyr 


Asp 


Glu 


Gly Asn Phe 


Phe Pro 








805 




Arg 


Phe 


He 


Ser Leu Glu 


Pro His 








820 




Cys 


He 


Ala 










835 







Cys Glu 


Arg 


He Gin 


Gly Arg Lys 




C ~i C 

635 




640 


Val Hxs 


Ser 


Glu His 


His Ser Leu 


C C A 

650 






655 


Gly Gin 


Lys 


Leu Asp 


Pro Arg He 


boo 






670 


Phe Phe 


Ser 


Ser Leu 


Pro Cys Gly 






685 


He Trp 


Asn 


Glu Leu 


His Ser Ser 






700 




Cys Val 


lie 


Glu Leu 


Ala Ser Lys 




715 




720 


Gly Phe 


Ala 


Val Val 


Arg Pro Pro 


730 






735 


Ala Met 


Gly 


Phe Cys 


Phe Phe Asn 


745 






750 


Leu Arg 


Asp 


Gin Leu 


Asn He Ser 






765 




Val His 


His 


Gly Asn 


Gly Thr Gin 






780 




He Leu 


Tyr 


He Ser 


Leu His Arg 




795 




800 


Gly Ser 


Gly Ala Pro 


Asn Glu Val 


810 






815 


Phe Tyr 


Leu 


Tyr Leu 


Ser Gly Asn 


825 






830 



<210> 9 

<211> 1791 

<212> DNA 

<213> Homo sapiens 

<400> 9 

ggggaagaga ggcacagaca cagataggag 
tgagggtttt tgcaacaaaa ccctagcagc 
ggacgagagc agctcttggc tcagcaaaga 
aagtcagaag ttcctgtggg cctggagccc 
aggatgatga tgcccgtggt ggaccctgtt 
cttcttatcc agcagcagca acaaatccag 
cagcatgaga acttgacacg gcagcaccag 
ctagccataa aacagcaaca agaactccta 
caagaacagg aagtagagag gcatcgcaga 
gatagaggac gagaaagggc agtggcaagt 
ctactgagta aatcagcaac gaaagacact 
cgccatccca agctctggta cacggctgcc 
ccccttagtg gaacatctcc atcctacaag 
gatgatttcc cccttcgaaa aactgaatcc 
cccagttcac caaacaatgg gccaactgga 
ccccctaccc ctcatgccga gcaaatggtt 
tccatgaacc tgctaagtct ttatacctct 
cccgcagtgc catcccagct caatgcttcg 
acgcagacgc ttaggcaagg tgttcctctg 
tcttccagcc accctcatgt tactttagag 
ctcctgcagc atttattatt gaaagaacaa 
ggagttccct tacatcctca gtctcccttg 
agaggtaccc acaaattgcc ccgtcacaga 
cctcagagca cgttggctca gctggtcatt 
cagaagcaat accagcagca gatccacatg 
ctgaagcaac caggcagtca ccttgaggaa 



aagggcaccg gctggagcca cttgcaggac 60 
ctgaagaact ctaagccaga tggggtggct 120 
atgcacagta tgatcagctc agtggatgtg 180 
atctcacctt tagacctaag gacagacctc 240 
gtccgtgaga agcaattgca gcaggaatta 300 
aagcagcttc tgatagcaga gtttcagaaa 360 
gctcagcttc aggagcatat caaggaactt 420 
gaaaaggagc agaaactgga gcagcagagg 480 
gaacagcagc ttcctcctct cagaggcaaa 540 
acagaagtaa agcagaagct tcaagagttc 600 
ccaactaatg gaaaaaatca ttccgtgagc 660 
caccacacat cattggatca aagctctcca 720 
tacacattac caggagcaca agatgcaaag 780 
tcagtcagta gcagttctcc aggctctggt 840 
agtgttactg aaaatgagac ttcggttttg 900 
tcacagcaac gcattctaat tcatgaagat 960 
ccttctttgc ccaacattac cttggggctt 1020 
aattcactca aagaaaagca gaagtgtgag 1080 
cctgggcagt atggaggcag catcccggca 1140 
ggaaagccac ccaacagcag ccaccaggct 1200 
atgcgacagc aaaagcttct tgtagctggt 1260 
gcaacaaaag agagaatttc acctggcatt 1320 
cccctgaacc gaacccagtc tgcacctttg 1380 
caacagcaac accagcaatt cttggagaag 1440 
aacaaactgc tttcgaaatc tattgaacaa 1500 
gcagaggaag agcttcaggg ggaccaggcg 1560 
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atgcaggaag acagagcgcc ctctagtggc aacagcacta ggagcgacag cagtgcttgt 1620 
gtggatgaca cactgggaca agttggggct gtgaaggtca aggaggaacc agtggacagt 1680 
gatgaagatg ctcagatcca ggaaatggaa tctggggagc aggctgcttt tatgcaacag 1740 
gtaataggca aagatttagc tccaggattt gtaattaaag tcattatctg a 1791 

<210> 10 
<211> 546 
<212> PRT 

<213> Homo sapiens 
<400> 10 

Met His Ser Met lie Ser Ser Val Asp Val Lys Ser Glu Val Pro Val 

15 10 15 

Gly Leu Glu Pro lie Ser Pro Leu Asp Leu Arg Thr Asp Leu Arg Met 

20 25 30 

Met Met Pro Val Val Asp Pro Val Val Arg Glu Lys Gin Leu Gin Gin 

35 40 45 

Glu Leu Leu Leu lie Gin Gin Gin Gin Gin lie Gin Lys Gin Leu Leu 

50 55 60 

lie Ala Glu Phe Gin Lys Gin His Glu Asn Leu Thr Arg Gin His Gin 
65 70 75 80 

Ala Gin Leu Gin Glu His He Lys Glu Leu Leu Ala He Lys Gin Gin 

85 90 95 

Gin Glu Leu Leu Glu Lys Glu Gin Lys Leu Glu Gin Gin Arg Gin Glu 

100 105 110 

Gin Glu Val Glu Arg His Arg Arg Glu Gin Gin Leu Pro Pro Leu Arg 

115 120 125 

Gly Lys Asp Arg Gly Arg Glu Arg Ala Val Ala Ser Thr Glu Val Lys 

130 135 140 

Gin Lys Leu Gin Glu Phe Leu Leu Ser Lys Ser Ala Thr Lys Asp Thr 
145 150 155 160 

Pro Thr Asn Gly Lys Asn His Ser Val Ser Arg His Pro Lys Leu Trp 

165 170 175 

Tyr Thr Ala Ala His His Thr Ser Leu Asp Gin Ser Ser Pro Pro Leu 

180 185 190 

Ser Gly Thr Ser Pro Ser Tyr Lys Tyr Thr Leu Pro Gly Ala Gin Asp 

195 200 205 

Ala Lys Asp Asp Phe Pro Leu Arg Lys Thr Glu Ser Ser Val Ser Ser 

210 215 220 

Ser Ser Pro Gly Ser Gly Pro Ser Ser Pro Asn Asn Gly Pro Thr Gly 
225 230 235 240 

Ser Val Thr Glu Asn Glu Thr Ser Val Leu Pro Pro Thr Pro His Ala 

245 250 255 

Glu Gin Met Val Ser Gin Gin Arg He Leu He His Glu Asp Ser Met 

260 265 270 

Asn Leu Leu Ser Leu Tyr Thr Ser Pro Ser Leu Pro Asn He Thr Leu 

275 280 285 

Gly Leu Pro Ala Val Pro Ser Gin Leu Asn Ala Ser Asn Ser Leu Lys 

290 295 300 

Glu Lys Gin Lys Cys Glu Thr Gin Thr Leu Arg Gin Gly Val Pro Leu 
305 310 315 320 

Pro Gly Gin Tyr Gly Gly Ser He Pro Ala Ser Ser Ser His Pro His 

325 330 335 

Val Thr Leu Glu Gly Lys Pro Pro Asn Ser Ser His Gin Ala Leu Leu 

340 345 350 

Gin His Leu Leu Leu Lys Glu Gin Met Arg Gin Gin Lys Leu Leu Val 

355 360 365 

Ala Gly Gly Val Pro Leu His Pro Gin Ser Pro Leu Ala Thr Lys Glu 

370 375 380 

Arg He Ser Pro Gly He Arg Gly Thr His Lys Leu Pro Arg His Arg 
385 390 395 400 

Pro Leu Asn Arg Thr Gin Ser Ala Pro Leu Pro Gin Ser Thr Leu Ala 
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Gin Leu val lie Gil Gin Gin His Gin £J Phe Leu Glu Lys Gin Lys 

425 a *^ n 

Gin Tyr Gin Gin Gin He His Met Asn Lys Leu Leu Ser Lys Ser lie 

4J:> 440 445 

Glu Gin Leu Lys Gin Pro Gly Ser His Leu Glu Glu Ala Glu Glu Glu 

455 46Q 
Leu Gin Gly Asp Gin Ala Met Gin Glu Asp Arg Ala Pro Ser Ser Gly 

470 475 A * 

Asn Ser Thr Arg Ser Asp Ser Ser Ala Cys Val Asp Asp Thr Leu Gly 

485 490 
Gin Val Gly Ala Val Lys Val Lys Glu Glu Pro Val Asp Ser Isp Glu 

Asp Ala Gin lie Gin Glu Met Glu Ser Gly Glu Gin Ala Ala Phe Met 



515 520 5,5 

Gin Gin Val lie Gly Lys Asp Leu Ala Pro Gly Phe Val He Lys Val 

He lie 540 
545 



<210> 11 
<211> 590 
<212> PRT 

<213> Homo sapiens 
<400> 11 

Met His Ser Met lie Ser Ser Val Asp Val Lys Ser Glu Val Pro Val 
Gly Leu Glu Pro He Ser Pro Leu Asp Leu Arg Thr Asp Leu Arg Met 



Met Met Pro Val Val Asp Pro Val Val Arg Glu Lys Gin Leu Gin Gin 
" 40 k 



Glu Leu Leu Leu He Gin Gin gLi Gin Gin He Gin Lys Gin Leu Leu 



55 60 



He Ala Glu Phe Gin Lys Gin His Glu Asn Leu Thr Arg Gin His Gin 



70 75 80 



Ala Gin Leu Gin Glu His He Lys Glu Leu Leu Ala He Lys Gin Gin 

Gin Glu Leu Leu Glu Lys Glu Gin Lys Leu Glu Gin Gin Arg Gin Glu 

Gin Glu Val Glu Arg His Arg Arg gSu Gin Gin Leu Pro Pro Leu Arg 

- LXb 120 195 

Gly Lys Asp Arg Gly Arg Glu Arg Ala Val Ala Ser Thr Glu Val Lys 

135 140 
Gin Lys Leu Gin Glu Phe Leu Leu Ser Lys Ser Ala Thr Lys Asp Thr 

155 i (z r\ 

Pro Thr Asn Gly Lys Asn His Ser Val Ser Arg His Pro Lys Leu Trp 

170 1 ntz 

Tyr Thr Ala Ala His His Thr Ser Leu Asp Gin Ser Ser Pro Pro Leu 



160 



165 



180 185 190 

Ala 

Ala Lys Asp Asp Phe Pro Leu Lys Thr Ala Ser Glu" Pro Asn Leu 



Ser Gly Thr Ser Pro Ser Tyr Lys Tyr Thr Leu Pro Gly A ia Gin Asp 

200 205 



215 220 

225 **" *** £S LYS GlD LyS Val Ma Glu ^g Arg Ser Ser 

235 o /i a 

Pro Leu Leu Arg Arg Lys Asp Gly Asn Val Val Thr Ser Phe Lys Lys 

^45 250 orr 

Arg Met Phe Glu Val Thr Glu Ser Ser Val Ser Ser Ser Ser Pro Gly 

o _ 265 270 

Ser Gly Pro Ser Ser Pro Asn Asn Gly Pro Thr Gly Ser Val Thr Glu 
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280 


Asn 




iiix 


oer 


Val 


Leu 


Pro 


Pro 




Azf\J 










295 




OgX 


LrXXl 


pi >> 


Arg 


He 


Leu 


lie 


His 


? OR 

J v J 










310 






Leu 


ryr 


1I1X 


oer 


Pro 


Ser 


Leu 


Pro 










325 








VAX 


xrx u 


Cor 

OCX 


Pi n 

VjXIX 


Leu 


Asn 


Ala 


Ser 








IAD 












vjlU 


nTU >- 
Iflx 




Thr Leu Arg Gin 






TCC 
JJJ 










360 


biy 


p 1 , » 

biy 


Ser 


lie 


Pro 


Ala 


Ser 


Ser 




j / u 










375 






Lys 


Pro 


Pro 


Asn 


Ser 


Ser 


His 


JOJ 










390 






Leu. 


Lys 


bill 


vjin 


Met 


Arg 


Gin 


Gin 










405 








Pro 


Leu 




Pro 


Gin 


Ser 


Pro 


Leu 


















vjxy 


11G 


Arg 


pi,, 

biy 


Thr 


His 


Lys 


Leu 






/i o cr 
4Jb 










440 


Thr 


Gin 


Ser 


Ala 


Pro 


Leu 


Pro 


Gin 




450 










455 




bin 


Gin 


Gin 


His 


Gin 


Gin 


Phe 


Leu 


4bb 










470 






bin 


lie 


HIS 


Met. 


Asn 


Lys 


Leu 


Leu 










485 








Gin 


Pro 


Gly 


Ser 


His 


Leu 


Glu 


Glu 








500 










Gln 


Ala 


Met 


Gin 


Glu Asp Arg Ala 






515 










520 


Ser 


Asp 


Ser 


Ser 


Ala 


Cys 


Val 


Asp 




530 










535 




Val 


Lys 


Val 


Lys 


Glu 


Glu 


Pro 


Val 


545 










550 






Gin 


Glu 


Met 


Glu 


Ser Gly Glu Gin 










565 








Gly 


Lys 


Asp 


Leu 


Ala 


Pro Gly Phe 








580 











<210> 12 
<211> 1084 
<212> PRT 

<213> Homo sapiens 



<400> 12 



Met 


Ser 


Ser 


Gin 


Ser His Pro Asp 


1 








5 


Val 


Glu 


Leu 


Leu 


Asn Pro Ala Arg 








20 




Asp 


Val 


Ala 


Thr 


Ala Leu Pro Leu 






35 




40 


Met 


Asp 


Leu 


Arg 


Leu Asp His Gin 




50 






55 


Ala 


Leu 


Arg 


Glu 


Gin Gin Leu Gin 


65 








70 


Lys 


Gin 


Gin 


He 


Gin Arg Gin He 










85 


His 


Glu 


Gin 


Leu 


Ser Arg Gin His 








100 




Lys 


Gin 


Gin 


Gin 


Glu Met Leu Ala 
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285 



XI IX 


Pro 


xlxS 


7v 1 ^ pi n 
Ala bill 


vain 


Mot- 
net 


Vol 








inn 

jUU 








(21 n 
vjlu 


Asp 


Ser 


Met Asn 


Leu 


Leu 


Ser 






J ID 








j z u 


Asn 


lie 


Thr 


Leu Gly 


Leu 


Pro 


Aia 












JJJ 




Asn 


Ser 


Leu 


_ 

Lys Glu 


Lys 


pi — , 

bin 


Lys 










J Jw 






Pi \r 

vjxy 


vai 


Pro 


x^eu fro 


pi 17- 

biy 


pi -n 

bin 


Tyr 








JOJ 








Ser 


nlS 


Pro 


his vai 


Thr 


Leu 


blU 








J ou 








pi t* 

vain 


Aia 


Leu 


Leu Gin 


TT 4 r~t 
HIS 


Leu 


Leu 














4 U U 


Lys 


Leu 


Leu 


TT- ~\ TV 1 — 

vai Aia 


biy 


biy 


Val 




a 1 n 
410 








/tic 
41b 




Ala 


Thr 


Lys 


Glu Arg 


He 


Ser 


Pro 


/lot; 
4zb 








a i r\ 






Pro 


Arg 


His 


Arg Pro 


Leu 


Asn 


Arg 








A A C 








Ser 


Thr 


Leu 


Ala Gin 


Leu 


Val 


He 








460 








Glu 


Lys 


Gin 


Lys Gin 


Tyr 


Gin 


Gin 






475 








480 


Ser 


Lys 


Ser 


He Glu 


Gin 


Leu 


Lys 




490 








495 




Ala 


Glu 


Glu 


Glu Leu 


Gin 


Gly 


Asp 


505 








510 






Pro 


Ser 


Ser 


Gly Asn 


Ser 


Thr 


Arg 








525 








Asp 


Thr 


Leu 


Gly Gin 


Val 


Gly 


Ala 








540 








Asp 


Ser 


Asp 


Glu Asp 


Ala 


Gin 


He 






ccc 
JJJ 








bou 


Ala 


TV T _ 

Ala 


Pne 


Met Gin 


Gin 


Val 


lie 




b / U 








b /b 




Val 


lie 


Lys 


Val He 


lie 






585 








C Q A 

by u 






ijxy 


Leu 


OCX 


pi tr TV -y~rr 
vaiy AX y 


Asp 


pi n 

OlXl 


Pro 




10 








15 




Val 


Asn 


His 


Met Pro 


Ser 


Thr 


Val 


25 








30 






Gin 


Val 


Ala 


Pro Ser 


Ala 


Val 


Pro 








45 








Phe 


Ser 


Leu 


Pro Val 


Ala 


Glu 


Pro 








60 








Gin 


Glu 


Leu 


Leu Ala 


Leu 


Lys 


Gin 






75 








80 


Leu 


He 


Ala 


Glu Phe 


Gin 


Arg 


Gin 




90 








95 




Glu 


Ala 


Gin 


Leu His 


Glu 


His 


He 


105 








110 






Met 


Lys 


His 


Gin Gin 


Glu 


Leu 


Leu 
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115 120 12 5 

Glu His Gin Arg Lys Leu Glu Arg His Arg Gln Glu Gin Glu Leu Glu 

13 0 135 140 

Lys Gin His Arg Glu Gin Lys Leu Gin Gin Leu Lys Asn Lys Glu Lys 

15° 155 160 

Gly Lys Glu Ser Ala Val Ala Ser Thr Glu Val Lys Met Lys Leu Gin 

165 170 " 175 

Glu Phe. Val Leu Asn Lys Lys Lys Ala Leu Ala His Arg Asn Leu Asn 

180 185 ig 0 

His Cys lie Ser Ser Asp Pro Arg Tyr Trp Tyr Gly Lys Thr Gin His 
l g 5 200 205 

SSr o?. L6U Asp Gln Ser Ser Pro Pro Gln Se ^ Gly Val Ser Thr Ser 

210 215 220 

Tyr Asn His Pro Val Leu Gly Met Tyr Asp Ala Lys Asp Asp Phe Pro 
r , 230 235 240 

Leu Arg Lys Thr Ala Ser Glu Pro Asn Leu Lys Leu Arg Ser Arg Leu 

245 250 ~ 255 

Lys Gln Lys Val Ala Glu Arg Arg Ser Ser Pro Leu Leu Arg Arg Lys 

260 265 270 

Asp Gly Pro Val Val Thr Ala Leu Lys Lys Arg Pro Leu Asp Val Thr 
275 280 285 

ASP fan Ma CyS SSr Ser Ala Pr ° Gly Ser Pro Ser Ser Pro Asn 

, 290 295 300 

Asn Ser Ser Gly Ser Val Ser Ala Glu Asn Gly He Ala Pro Ala Val 

310 315 320 

Pro Ser He Pro Ala Glu Thr Ser Leu Ala His Arg Leu Val Ala Arq 

325 330 335 

Glu Gly Ser Ala Ala Pro Leu Pro Leu Tyr Thr Ser Pro Ser Leu Pro 

340 345 350 

Asn He Thr Leu Gly Leu Pro Ala Thr Gly Pro Ser Ala Gly Thr Ala 

355 36O 365 

Gly Gln Gln Asp Thr Glu Arg Leu Thr Leu Pro Ala Leu Gln Gln Arg 

370 375 380 

Leu Ser Leu Phe Pro Gly Thr His Leu Thr Pro Tyr Leu Ser Thr Ser 

390 395 4Q0 

Pro Leu Glu 'Arg Asp Gly Gly Ala Ala His Ser Pro Leu Leu Gln His 

405 410 415 

Met Val Leu Leu Glu Gln Pro Pro Ala Gln Ala Pro Leu Val Thr Glv 
420 425 43 0 Y 

Leu Gly Ala Leu Pro Leu His Ala Gln Ser Leu Val Gly Ala Asp Arg 
435 440 445 y u 

Val Ser Pro Ser He His Lys Leu Arg Gln His Arg Pro Leu Gly Arg 

450 455 ^gg 

Thr Gln Ser Ala Pro Leu Pro Gln Asn Ala Gln Ala Leu Gln His Leu 

470 475 48Q 

Val He Gln Gln Gln His Gln Gln Phe Leu Glu Lys His Lys Gln Gln 

485 490 495 

Phe Gln Gln Gln Gln Leu Gln Met Asn Lys He He Pro Lys Pro Ser 

500 505 5i 0 

Glu Pro Ala Arg Gln Pro Glu Ser His Pro Glu Glu Thr Glu Glu Glu 

515 520 525 

Leu Arg Glu His Gln Ala Leu Leu Asp Glu Pro Tyr Leu Asp Arg Leu 
=30 535 540 

Pro Gly Gln Lys Glu Ala His Ala Gln Ala Gly Val Gln Val Lys Gln 

550 555 560 

Glu Pro He Glu Ser Asp Glu Glu Glu Ala Glu Pro Pro Arg Glu Val 
„. 565 570 ' 575 

Glu Pro Gly Gln Arg Gln Pro Ser Glu Gln Glu Leu Leu Phe Arg Gln 

580 585 590 

Gln Ala Leu Leu Leu Glu Gln Gln Arg He His Gln Leu Arg Asn Tyr 

595 600 605 

Gln Ala Ser Met Glu Ala Ala Gly He Pro Val Ser Phe Gly Gly His 
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610 






615 








620 








Arg 


Pro 


Leu 


Ser 


Arg Ala Gin 


Ser 


Ser Pro 


Ala 


Ser 


Ala 


Thr 


Phe Pro 


625 








630 






635 








640 


Val 


Ser 


Val 


Gin 


Glu Pro Pro 


Thr 


Lys Pro 


Arg 


Phe 


Thr 


Thr 


Gly Leu 










645 




650 










655 


Val 


Tyr 


Asp 


Thr 


Leu Met Leu 


Lys 


His Gin 


Cys 


Thr 


Cys 


Gly 


Ser Ser 








660 






665 








670 




Ser 


Ser 


His 


Pro 


Glu His Ala 


Gly 


Arg lie 


Gin 


Ser 


He 


Trp 


Ser Arg 






675 






680 








685 






Leu 


Gin 


Glu 


Thr 


Gly Leu Arg 


Gly 


Lys Cys 


Glu 


Cys 


lie 


Arg 


Gly Arg 




690 






695 








700 








Lys 


Ala 


Thr 


Leu 


Glu Glu Leu 


Gin 


Thr Val 


His 


Ser 


Glu 


Ala 


His Thr 


705 








710 






715 








720 


Leu 


Leu 


Tyr 


Gly 


Thr Asn Pro 


Leu 


Asn Arg 


Gin 


Lys 


Leu 


Asp 


Ser Lys 










725 




730 










735 


Lys 


Leu 


Leu 


Gly 


Ser Leu Ala 


Ser 


Val Phe 


Val 


Arg 


Leu 


Pro 


Cys Gly 








740 






745 








750 




Gly 


Val 


Gly 

J; 


Val 


Asp Ser Asp 


Thr 


He Trp 


Asn 


Glu 


Val 


His 


Ser Ala 






755 






760 








765 






Gly 


Ala 


Ala 


Arg 


Leu Ala Val 


Gly 


Cys Val 


Val 


Glu 


Leu 


Val 


Phe Lys 




770 






775 








780 








Val 


Ala 


Thr 


Glv 


Glu Leu Lys 


Asn 


Gly Phe 


Ala 


Val 


Val 


Arg 


Pro Pro 


785 








790 






795 








800 


Gly 


His 


His 


Ala 


Glu Glu Ser 


Thr 


Pro Met 


Glv 


Phe 


Cvs 


Tvr 


Phe Asn 










805 




810 










815 


Ser 


Val 


Ala 


Val 


Ala Ala Lys 


Leu 


Leu Gin 


Gin 


Ara 


Leu 


Ser 


Val Ser 








820 






825 








830 




Lys 


lie 


Leu 


He 


Val Asp Trp 


Asd 


Val His 


His 


Gly 


Asn 


Glv 


Thr Gin 






835 






840 








845 






Gin 


Ala 


Phe 


Tyr 


Ser Asp Pro 


Ser 


Val Leu 


Tyr 


Met 


Ser 


Leu 


His Arg 




850 






855 








860 








Tyr 


Asp 


Asp 


Gly 


Asn Phe Phe 


Pro 


Gly Ser 


Gly 


Ala 


Pro 


Asp 


Glu Val 


865 








870 






875 








880 


Gly 


Thr 


Gly 


Pro 


Gly Val Gly 


Phe 


Asn Val 


Asn 


Met 


Ala 


Phe 


Thr Gly 










885 




890 










895 


Gly 


Leu 


Asp 


Pro 


Pro Met Gly 


Asp 


Ala Glu 


Tyr 


Leu 


Ala 


Ala 


Phe Arg 








900 






905 








910 




Thr 


Val 


Val 


Met 


Pro He Ala 


Ser 


Glu Phe 


Ala 


Pro 


Asp 


Val 


Val Leu 






915 






920 








925 






Val 


Ser 


Ser 


Gly 


Phe Asp Ala 


Val 


Glu Gly 


His 


Pro 


Thr 


Pro 


Leu Gly 




930 






935 








940 








Gly 


Tyr 


Asn 


Leu 


Ser Ala Arg 


Cys 


Phe Gly 


Tyr 


Leu 


Thr 


Lys 


Gin Leu 


945 








950 






955 








960 


Met. 


Gly 


Leu 


Ala 


Gly Gly Arg 


He 


Val Leu 


Ala 


Leu 


Glu 


Gly 


Gly His 










965 




970 










975 


Asp 


Leu 


Thr 


Ala 


He Cys Asp 


Ala 


Ser Glu 


Ala 


Cys 


Val 


Ser 


Ala Leu 








980 






985 








990 




Leii 


Gly 


Asn 


Glu 


Leu Asp Pro 


Leu 


Pro Glu 


Lys 


Val 


Leu 


Gin 


Gin Arg 






995 






1000 






1005 




Pro 


Asn 


Ala 


Asn 


Ala Val Arg 


Ser 


Met Glu 


Lys 


Val 


Met 


Glu 


He His 




1010 




1015 






1020 






Ser 


Lys 


Tyr 


Trp 


Arg Cys Leu 


Gin 


Arg Thr 


Thr 


Ser 


Thr 


Ala 


Gly Arg 


1025 






1030 






1035 






1040 


Ser 


Leu 


He 


Glu 


Ala Gin Thr Cys Glu Asn 


Glu 


Glu 


Ala 


Glu 


Thr Val 










1045 




1050 








1055 


Thr 


Ala 


Met 


Ala 


Ser Leu Ser Val Gly Val Lys 


Pro 


Ala 


Glu 


Lys Arg 








1060 




1065 








1070 


Pro Asp Glu 


Glu 


Pro Met Glu 


Glu 


Glu Pro 


Pro 


Leu 









1075 1080 



<210> 13 
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<211> 3550 

<212> DWA 

<213> Homo sapiens 

<400> 13 



SISSt?t taZtT ^ Cag f ta ^ aagggcaccg gctggagcca cttgcaggac 60 

=1 I E LSI LiSi SIF ~ a 
IliiEHSillli^Eii 

EE s I! §F « =i iiSi a 
« |= S= 1=1 1~ H= E 
i III ™ Eii S 2 =» =S is 

agccaccagg ctctcctaca acatt? a h^ Sttactttag agggaaagcc acccaacagc 1320 

ssss saS 5=i 55 =5 =- 

SS52S f ~ SSSS 2SSE IKS 

^t-f-**-~ g^gacgaaga tgctcagatc caggaaatgg aatctgggcra acaaactart 7fl*n 

IS ill 551 S3 IS sis 

ao«»S„„? ttttacctca cccagcaatg gaccgccccc tecagcctgg ctctgcaact 2100 
oSctgS «o«a™S <=— agtgcg tttgjggcl. ttcaloc.cc 2U0 

ctaSJSS " SC «3» C J aataoagagt atctggtcac gaotgcaaga aactgggctg 2220 
l""tg"c atXSS "»0«»9oc tggagg.alt ac.g«?gtl 22sS 

ssssss sees g Lii ™« ssss sssss ass 

F F S SSSS £352 =322 S2S 

S ii Eii ~ =ss sss 

ssi iiss III s ™ »s e 

ilEI ill I P «1 Si =1 is 

tat-p^ha^ SWtgtgtctt tctcttccca ggttggaaca ggccttggag aagggtacaa 3120 
l^tl gcctggacag gtggccttga tcctcccatg ggagatgttg agtfccttaa 3180 
atctgctggl tttaataS agCCtgtggC —agagtt? gltccagacl tggtcttagt UTo 
ggcaaaaSt tttlll^t ^ ggaaggcca cacccctcet ctaggagggt acaaagtgac 3300 
„?^ aa ^ gt ttfc 9fftcatt tgacgaagca attgatgaca ttggctgatg gacgtgtacrt 3360 
gttggctcta gaaggaggac atgatctcac agccatctgt gatgcatcag SStJtS 3«S 
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aaatgccctt ctaggaaatg agctggagcc acttgcagaa gatattctcc accaaagccc 3480 
gaatatgaat gctgttattt ctttacagaa gatcattgaa attcaaagta tgtctttaaa 3540 
gttctcttaa 3550 

<210> 14 

<211> 7699 . 

<212> DNA 

<213> Homo sapiens 

<400> 14 

cccattcgcc attcaggctg cgcaactgtt gggaagggcg atcggtgcgg gcctcttcgc 60 
tattacgcca gctggcgaaa gggggatgtg ctgcaaggcg attaagttgg gtaacgccca 120 
gggttttccc agtcacgacg ttgtaaaacg acggccagtg ccaagctgat ctaatcaata 180 
ttggccatta gccatattat tcattggtta tatagcataa atcaatattg gctattggcc 240 
attgcatacg ttgtatccat atcataatat gtacatttat attggctcat gtccaacatt 300 
accgccatgt tgacattgat tattgactag ttattaatag taatcaatta cggggtcatt 360 
agttcatagc ccatatatgg agttccgcgt tacataactt acggtaaatg gcccgcctgg 420 
cgaccgccca gcgacccccg cccgttgacg tcaatagtga cgtatgttcc catagtaacg 480 
ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac tgcccacttg 540 
gcagtacatc aagtgtatca tatgccaagt ccgcccccta ttgacgtcaa tgacggtaaa 600 
tggcccgcct agcattatgc ccagtacatg accttacggg agtttcctac ttggcagtac 660 
atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta caccaatggg 720 
cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga cgtcaatggg 780 
agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaataa ccccgccccg 840 
ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag agctcgttta 900 
gtgaaccgtc agaattcaag cttgcggccg cagatctatc gatctgcagg atatcaccat 960 
gcacagtatg atcagctcag tggatgtgaa gtcagaagtt cctgtgggcc tggagcccat 1020 
ctcaccttta gacctaagga cagacctcag gatgatgatg cccgtggtgg accctgttgt 1080 
ccgtgagaag caattgcagc aggaattact tcttatccag cagcagcaac aaatccagaa 1140 
gcagcttctg atagcagagt ttcagaaaca gcatgagaac ttgacacggc agcaccaggc 1200 
tcagcttcag gagcatatca aggaacttct agccataaaa cagcaacaag aactcctaga 1260 
aaaggagcag aaactggagc agcagaggca agaacaggaa gtagagaggc atcgcagaga 1320 
acagcagctt cctcctctca gaggcaaaga tagaggacga gaaagggcag tggcaagtac 1380 
agaagtaaag cagaagcttc aagagttcct actgagtaaa tcagcaacga aagacactcc 1440 
aactaatgga aaaaatcatt ccgtgagccg ccatcccaag ctctggtaca cggctgccca 1500 
ccacacatca ttggatcaaa gctctccacc ccttagtgga acatctccat cctacaagta 1560 
cacattacca ggagcacaag atgcaaagga tgatttcccc cttcgaaaaa ctgcctctga 1620 
gcccaacttg aaggtgcggt ccaggttaaa acagaaagtg gcagagagga gaagcagccc 1680 
cttactcagg cggaaggatg gaaatgttgt cacttcattc aagaagcgaa tgtttgaggt 1740 
gacagaatcc tcagtcagta gcagttctcc aggctctggt cccagttcac caaacaatgg 1800 
gccaactgga agtgttactg aaaatgagac ttcggttttg ccccctaccc ctcatgccga 1860 
gcaaatggtt tcacagcaac gcattctaat tcatgaagat tccatgaacc tgctaagtct 1920 
ttatacctct ccttctttgc ccaacattac cttggggctt cccgcagtgc catcccagct 1980 
caatgcttcg aattcactca aagaaaagca gaagtgtgag acgcagacgc ttaggcaagg 2040 
tgttcctctg cctgggcagt atggaggcag catcccggca tcttccagcc accctcatgt 2100 
tactttagag ggaaagccac ccaacagcag ccaccaggct. ctcctgcagc atttattatt 2160 
gaaagaacaa atgcgacagc aaaagcttct tgtagctggt ggagttccct tacatcctca 2220 
gtctcccttg gcaacaaaag agagaatttc acctggcatt agaggtaccc acaaattgcc 2280 
ccgtcacaga cccctgaacc gaacccagtc tgcacctttg cctcagagca cgttggctca 2340 
gctggtcatt caacagcaac accagcaatt cttggagaag cagaagcaat accagcagca 2400 
gatccacatg aacaaactgc tttcgaaatc tattgaacaa ctgaagcaac caggcagtca 2460 
ccttgaggaa gcagaggaag agcttcaggg ggaccaggcg atgcaggaag acagagcgcc 2520 
ctctagtggc aacagcacta ggagcgacag cagtgcttgt gtggatgaca cactgggaca 2580 
agttggggct gtgaaggtca aggaggaacc agtggacagt gatgaagatg ctcagatcca 2640 
ggaaatggaa tctggggagc aggctgcttt tatgcaacag cctttcctgg aacccacgca 2700 
cacacgtgcg ctctctgtgc gccaagctcc gctggctgcg gttggcatgg atggattaga 2760 
gaaacaccgt ctcgtctcca ggactcactc ttcccctgct gcctctgttt tacctcaccc 2820 
agcaatggac cgccccctcc agcctggctc tgcaactgga attgcctatg accccttgat 2880 
gctgaaacac cagtgcgttt gtggcaattc caccacccac cctgagcatg ctggacgaat 2940 
acagagtatc tggtcacgac tgcaagaaac tgggctgcta aataaatgtg agcgaattca 3000 
aggtcgaaaa gccagcctgg aggaaataca gcttgttcat tctgaacatc actcactgtt 3060 
gtatggcacc aaccccctgg acggacagaa gctggacccc aggatactcc taggtgatga 3120 
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ctctcaaaag tttttttcct cattaccttg tggtggactt ggggtggaca gtgacaccat 3180 
ttggaatgag ctacactcgt ccggtgctgc acgcatggct gttggctgtg tcatcgagct 3240 
ggcttccaaa gtggcctcag gagagctgaa gaatgggttt gctgttgtga ggccccctgg 3300 
^« C ?° gaagaatcca cagccatggg gttctgcttt tttaattcag ttgcaattac 3360 
cgccaaatac ttgagagacc aactaaatat aagcaagata ttgattgtag atctggatgt 3420 
tcaccatgga aacggtaccc agcaggcctt ttatgctgac cccagcatcc tgtacatttc 3480 
«™ tatgatgaag BBaacttttt ccctggcagt ggagccccaa atgaggttgg 3540 

aacaggcctt ggagaagggt acaatataaa tattgcctgg acaggtggcc ttgatcctcc 3600 
catgggagat gttgagtacc ttgaagcatt caggaccatc gtgaagcctg tggccaaaga 3660 
?"^ 9 f tCCa 9 acafc 99t<:t tagtatctgc tggatttgat gcattggaag gccacacccc 3720 
™^;^ agg f gggtacaaag tgacggcaaa atgttttggt catttgacga agcaattgat 3780 
gacattggct gatggacgtg tggtgttggc tctagaagga ggacatgatc tcacagccat 3840 
=„ 9 = ga ^ a t ^ agaagcct gtgtaaatgc ccttctagga aatgagctgg agccacttgc 3900 
?™*ltt c tccaccaaa gcccgaatat gaatgctgtt atttctttac agaagatcat 3960 
tgaaattcaa agtatgtctt taaagttctc tggatccggt accagattac aaggacgacg 4020 
™l? « Caagta Sratcccgggt ggcatccctg tgacccctcc ccagtgcctc tcctggcctt 4080 
ggaagttgcc actccagtgc cefaccagcct tgtcctaata aaattaagtt gcatcatttt 4140 
^„^f^ ag gtgtcctcta taatattatg gggtggaggg gggtggtatg gagcaagggg 4200 
^tt 9 " gaagacaacc tgtagggcct gcggggtcta ttcgggaacc aagctggagt 4260 
gcagtggcac aatcttggct cactgcaatc tccgcctcct gggttcaagc gattctcctg 4320 
^???f Ct ° ccgagttgtt Srggattccag gcatgcatga ccaggctcag ctaatttttg 4380 
ntllt f 9 t agagacgggg tttcaccata ttggccaggc tggtctccaa ctcctaatct 4440 
? acc caccttg gcctcccaaa ttgctgggat tacaggcgtg aaccactgct 4500 
aT,tr,T g ^ cttc ^ at t"aaaataa ctataccagc aggaggacgt ccagacacag 4560 
cataggctac ctgccatggc ccaaccggtg ggacatttga gttgcttgct tggcactgtc 4620 
"? ggt ^ aC tca 9tagatg cctgttgaat tgggtacgcg gccagcttct 4680 
gtggaatgtg tgtcagttag ggtgtggaaa gtccccaggc tccccagcag gcagaagtat 4740 
gcaaagcatg catctcaatt agtcagcaac caggtgtgga aaagtccccl ggc?ccccag 4800 
^ gg ^ agaag tat ^ caaa ^ atgcatctca attagtcagc aaccatagtc ccgcccctaa 4860 
taatthtr^ h^hT actccgccca STttccgccca ttctccgccc catggctgac 4920 
taattttttt tatttatgca gaggccgagg ccgcctcggc ctctgagcta ttccagaagt 4980 
agtgaggagg cttttttgga ggcctaggct tttgcaaaaa gctcctcgag gaactgaaaa 5040 
accagaaagt taattcccta tagtgagtcg tattaaattc gtaatcaSgg ?cata?c?g? 5100 
:Z^: g 3 g aaatt 9 tta t ccgctcacaa ttccacacaa catacgagcc ggaagcataa 5160 
far™ 3 ^ Ctggggtgcc taatgagtga gctaactcac attaattgcg ttgcgctcac 5220 
tgcccgcttt ccagtcggga aacctgtcgt gccagctgca ttaatgaatc ggccaacgcg 5280 
cggggagagg cggtttgcgt attgggcgct cttccgcttc ctcgctcact gactcgctgc 5340 
ccSSn * cggctgcgg cgagcggtat cagctcactc aaaggcggta atacggttat 5400 
™=f ^ t a 99^ a taac gcaggaaaga acatgtgagc aaaaggccag caaaaggcca 5460 
f?r a ™ aa aaaggccgcg "Srctggcgt ttttccatag gctccgcccc cctgacgagc 5520 
aaar« aa ^ a tcgacgctca agtcagaggt ggcgaaaccc gacaggacta taaagatacc 5580 
S" C ccctggaagc tccctcgtgc gctctcctgt tccgaccctg ccgcttaccg 5640 
nnl^ 9 cgcctttctc ccttcgggaa gcgtggcgct ttctcaatgc tcacgctgta 5700 
* ttcggtgtag gtcgttcgct ccaagctggg ctgtgtgcac gaaccccccg 5760 
ccgctgcgcc ttatccggta actatcgtct tgagtccaac ccggtaagac 5820 
alaatnltlZ gcca ^ gg <: a gcagccactg gtaacaggat tagcagagcg aggtatgtag 5880 
gcggtgctac agagttcttg aagtggtggc ctaactacgg ctacactaga agaacagtat 5940 
nllll Q cgctctgct 9 aagccagtta ccttcggaaa aagagttggt agctcttgat 6000 
™f aaaCa aaccaccgct 39tagcggtg gtttttttgt ttgcaagcag cagattacgc 6060 
aalllllllt aggatc ^ aa gaagatcctt tgatcttttc tacggggtct gacgotcagt 6120 
f gaa ^ Saaaa ctca fgttaa gggattttgg tcatgagatt atcaaaaagg atcttcacct 6180 
agatcctttt aaattaaaaa tgaagtttta aatcaatcta aagtatatat gagtaaactt 6240 
g9 ^ gaCag tta ^aatgc ttaatcagtg aggcacctat ctcagcgatc tgtctatttc 6300 
rllnlllr^. ^"^f^a ctccccgtcg tgtagataac tacgatacgg gagggcttac 6360 
cacctggccc cagtgctgca atgataccgc gagacccacg ctcaccggct ccagatttat 6420 
cagcaataaa ccagccagcc ggaagggccg agcgcagaag tggtcctgca actttatccg 6480 
ltttl*lTt gt ^ a "aat tgttgccggg aagctagagt aagtagttcg ccagttaata 6540 
g „"^ 9Caa cgttgttg cc attgctacag gcatcgtggt gtcacgctcg tcgtttggta 6600 
tggcttcatt cagctccggt tcccaacgat caaggcgagt tacatgatcc cccatgttgt 6660 
gcaaaaaagc ggttagctcc ttcggtcctc cgatcgttgt cagaagtaag ttggccgcag 6720 
cgttatcact catggttatg gcagcactgc ataattctct tactgtcatg ccatccgtaa 6780 
gatgcttttc tgtgactggt gagtactcaa ccaagtcatt ctgagaatag tgtatgcggc 6840 



WO 02/102984 



PCT/US02/19051 



22/25 



gaccgagttg ctcttgcccg gcgtcaatac 
taaaagtgct catcattgga aaacgttctt 
tgttgagatc cagttcgatg taacccactc 
ctttcaccag cgtttctggg tgagcaaaaa 
taagggcgac acggaaatgt tgaatactca 
tttatcaggg ttattgtctc atgagcggat 
aaataggggt tccgcgcaca tttccccgaa 
cattaagcgc ggcgggtgtg gtggttacgc 
tagcgcccgc tcctttcgct ttcttccctt 
gtcaagctct aaatcggggc atccctttag 
accccaaaaa acttgattag ggtgatggtt 
tttttcgccc tttgacgttg gagtccacgt 
gaacaacact caaccctatc tcggtctatt 
cggcctattg gttaaaaaat gagctgattt 
tattaaacgt ttacaattt 



gggataatac cgcgccacat agcagaactt 6900 
cggggcgaaa actctcaagg atcttaccgc 6960 
gtgcacccaa ctgatcttca gcatctttta 7020 
caggaaggca aaatgccgca aaaaagggaa 7080 
tactcttcct ttttcaatat tattgaagca 7140 
acatatttga atgtatttag aaaaataaac 7200 
aagtgccacc tgacgcgccc tgtagcggcg 7260 
gcagcgtgac cgctacactt gccagcgccc 7320 
cctttctcgc cacgttcgcc ggctttcccc 7380 
ggttccgatt tagtgcttta cggcacctcg 7440 
cacgtagtgg gccatcgccc tgatagacgg 7500 
tctttaatag tggactcttg ttccaaactg 7560 
cttttgattt ataagggatt ttgccgattt 7620 
aacaaaaatt taacgcgaat tttaacaaaa 7680 

7699 



<210> 15 

<211> 7303 

<212> DNA 

<213> Homo sapiens 

<400> 15 

cccattcgcc attcaggctg cgcaactgtt 
tattacgcca gctggcgaaa gggggatgtg 
gggttttccc agtcacgacg ttgtaaaacg 
ttggccatta gccatattat tcattggtta 
attgcatacg ttgtatccat atcataatat 
accgccatgt tgacattgat tattgactag 
agttcatagc ccatatatgg agttccgcgt 
cgaccgccca gcgacccccg cccgttgacg 
ccaataggga ctttccattg acgtcaatgg 
gcagtacatc aagtgtatca tatgccaagt 
tggcccgcct agcattatgc ccagtacatg 
atctacgtat tagtcatcgc tattaccatg 
cgtggatagc ggtttgactc acggggattt 
agtttgtttt ggcaccaaaa tcaacgggac 
ttgacgcaaa tgggcggtag gcgtgtacgg 
gtgaaccgtc agaattcaag cttgcggccg 
gcacagtatg atcagctcag tggatgtgaa 
ctcaccttta gacctaagga cagacctcag 
ccgtgagaag caattgcagc aggaattact 
gcagcttctg atagcagagt ttcagaaaca 
tcagcttcag gagcatatca aggaacttct 
aaaggagcag aaactggagc agcagaggca 
acagcagctt cctcctctca gaggcaaaga 
agaagtaaag cagaagcttc aagagttcct 
aactaatgga aaaaatcatt ccgtgagccg 
ccacacatca ttggatcaaa gctctccacc 
cacattacca ggagcacaag atgcaaagga 
gcccaacttg aaggtgcggt ccaggttaaa 
cttactcagg cggaaggatg gaaatgttgt 
gacagaatcc tcagtcagta gcagttctcc 
gccaactgga agtgttactg aaaatgagac 
gcaaatggtt tcacagcaac gcattctaat 
ttatacctct ccttctttgc ccaacattac 
caatgcttcg aattcactca aagaaaagca 
tgttcctctg cctgggcagt atggaggcag 
tactttagag ggaaagccac ccaacagcag 
gaaagaacaa atgcgacagc aaaagcttct 
gtctcccttg gcaacaaaag agagaatttc 



gggaagggcg atcggtgcgg gcctcttcgc 60 
ctgcaaggcg attaagttgg gtaacgccca 120 
acggccagtg ccaagctgat ctaatcaata 180 
tatagcataa atcaatattg gctattggcc 240 
gtacatttat attggctcat gtccaacatt 300 
ttattaatag taatcaatta cggggtcatt 360 
tacataactt acggtaaatg gcccgcctgg 420 
tcaatagtga cgtatgttcc catagtaacg 480 
gtggagtatt tacggtaaac tgcccacttg 540 
ccgcccccta ttgacgtcaa tgacggtaaa 600 
accttacggg agtttcctac ttggcagtac 660 
gtgatgcggt tttggcagta caccaatggg 720 
ccaagtctcc accccattga cgtcaatggg 780 
tttccaaaat gtcgtaataa ccccgccccg 840 
tgggaggtct atataagcag agctcgttta 900 
cagatctatc gatctgcagg atatcaccat 960 
gtcagaagtt cctgtgggcc tggagcccat 1020 
gatgatgatg cccgtggtgg accctgttgt 1080 
tcttatccag cagcagcaac aaatccagaa 1140 
gcatgagaac ttgacacggc agcaccaggc 1200 
agccataaaa cagcaacaag aactcctaga 1260 
agaacaggaa gtagagaggc atcgcagaga 1320 
tagaggacga gaaagggcag tggcaagtac 1380 
actgagtaaa tcagcaacga aagacactcc 1440 
ccatcccaag ctctggtaca cggctgccca 1500 
ccttagtgga acatctccat cctacaagta 1560 
tgatttcccc cttcgaaaaa ctgcctctga 1620 
acagaaagtg gcagagagga gaagcagccc 1680 
cacttcattc aagaagcgaa tgtttgaggt 1740 
aggctctggt cccagttcac caaacaatgg 1800 
ttcggttttg ccccctaccc ctcatgccga 1860 
tcatgaagat tccatgaacc tgctaagtct 1920 
cttggggctt cccgcagtgc catcccagct 1980 
gaagtgtgag acgcagacgc ttaggcaagg 2040 
catcccggca tcttccagcc accctcatgt 2100 
ccaccaggct ctcctgcagc atttattatt 2160 
tgtagctggt ggagttccct tacatcctca 2220 
acctggcatt agaggtaccc acaaattgcc 2280 
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ccgtcacaga cccctgaacc gaacccagtc tgcacctttg cctcagagca cgttggctca 2340 
gctggtcatt caacagcaac accagcaatt cttggagaag cagaagcaat accagcagca 2400 
gatccacatg aacaaactgc tttcgaaatc tattgaacaa ctgaagcaac caggcagtca 2460 
ccttgaggaa gcagaggaag agcttcaggg ggaccaggcg atgcaggaag acagagcgcc 2520 
ctctagtggc aacagcacta ggagcgacag cagtgcttgt gtggatgaca cactgggaca 2580 
agttggggct gtgaaggtca aggaggaacc agtggacagt gatgaagatg ctcagatcca 2640 
ggaaatggaa tctggggagc aggctgcttt tatgcaacag cctttcctgg aacccacgca 2700 
cacacgtgcg ctctctgtgc gccaagctcc gctggctgcg gttggcatgg atggattaga 2760 
gaaacaccgt ctcgtctcca ggactcactc ttcccctgct gcctctgttt tacctcaccc 2820 
agcaatggac cgccccctcc agcctggctc tgcaactgga attgcctatg accccttgat 2880 
gctgaaacac cagtgcgttt gtggcaattc caccacccac cctgagcatg ctggacgaat 2940 
acagagtatc tggtcacgac tgcaagaaac tgggctgcta aataaatgtg agcgaattca 3000 
aggtcgaaaa gccagcctgg aggaaataca gcttgttcat tctgaacatc actcactgtt 3060 
gtatggcacc aaccccctgg acggacagaa gctggacccc aggatactcc taggtgatga 3120 
ctctcaaaag tttttttcct cattaccttg tggtggactt ggggtggaca gtgacaccat 3180 
ttggaatgag ctacactcgt ccggtgctgc acgcatggct gttggctgtg tcatcgagct 3240 
ggcttccaaa gtggcctcag gagagctgaa gaatgggttt gctgttgtga ggccccctgg 3300 
ccatcacgct gaagaatcca cagccatggg gttctgcttt tttaattcag ttgcaattac 3360 
cgccaaatac ttgagagacc aactaaatat aagcaagata ttgattgtag atctggatgt 3420 
tcaccatgga aacggtaccc agcaggcctt ttatgctgac cccagcatcc tgtacatttc 3480 
actccatcgc tatgatgaag ggaacttfctt ccctggcagt ggagccccaa atgaggttcg 3540 
gtttatttct ttagagcccc acttttattt gtatctttca ggtaattgca ttgcaggatc 3600 
cggtaccaga ttacaaggac gacgatgaca agtagatccc gggtggcatc cctgtgaccc 3660 
ctccccagtg cctctcctgg ccttggaagt tgccactcca gtgcccacca gccttgtcct 3720 
aataaaatta agttgcatca ttttgtctga ctaggtgtcc tctataatat tatggggtgg 3780 
aggggggtgg tatggagcaa ggggcccaag ttgggaagac aacctgtagg gcctgcgggg 3840 
tctattcggg aaccaagctg gagtgcagtg gcacaatctt ggctcactgc aatctccgcc 3900 
tcctgggttc aagcgattct cctgcctcag cctcccgagt tgttgggatt ccaggcatgc 3960 
atgaccaggc tcagctaatt tttgtttttt tggtagagac ggggtttcac catattggcc 4020 
aggctggtct ccaactccta atctcaggtg atctacccac cttggcctcc caaattgctg 4080 
ggattacagg cgtgaaccac tgctcccttc cctgtccttc tgattttaaa ataactatac 4140 
cagcaggagg acgtccagac acagcatagg ctacctgcca tggcccaacc ggtgggacat 4200 
ttgagttgct tgcttggcac tgtcctctca tgcgttgggt ccactcagta gatgcctgtt 4260 
gaattgggta cgcggccagc ttctgtggaa tgtgtgtcag ttagggtgtg gaaagtcccc 4320 
aggctcccca gcaggcagaa gtatgcaaag catgcatctc aattagtcag caaccaggtg 4380 
tggaaaagtc cccaggctcc ccagcaggca gaagtatgca aagcatgcat ctcaattagt 4440 
cagcaaccat agtcccgccc ctaactccgc ccatcccgcc cctaactccg cccagttccg 4500 
cccattctcc gccccatggc tgactaattt tttttattta tgcagaggcc gaggccgcct 4560 
cggcctctga gctattccag aagtagtgag gaggcttttt tggaggccta ggcttttgca 4620 
aaaagctcct cgaggaactg aaaaaccaga aagttaattc cctatagtga gtcgtattaa 4680 
attcgtaatc atggtcatag ctgtttcctg tgtgaaattg ttatccgctc acaattccac 4740 
acaacatacg agccggaagc ataaagtgta aagcctgggg tgcctaatga gtgagctaac 4800 
tcacattaat tgcgttgcgc tcactgcccg ctttccagtc gggaaacctg tcgtgccagc 4860 
tgcattaatg aatcggccaa cgcgcgggga gaggcggttt gcgtattggg cgctcttccg 4920 
cttcctcgct cactgactcg ctgcgctcgg tcgttcggct gcggcgagcg gtatcagctc 4980 
actcaaaggc ggtaatacgg ttatccacag aatcagggga taacgcagga aagaacatgt 5040 
gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg gcgtttttcc 5100 ' 
ataggctccg cccccctgac gagcatcaca aaaatcgacg ctcaagtcag aggtggcgaa 5160 
acccgacagg actataaaga taccaggcgt ttccccctgg aagctccctc gtgcgctctc 5220 
ctgttccgac cctgccgctt accggatacc tgtccgcctt tctcccttcg ggaagcgtgg 5280 
cgctttctca atgctcacgc tgtaggtatc tcagttcggt gtaggtcgtt cgctccaagc 5340 
tgggctgtgt gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc ggtaactatc 5400 
gtcttgagtc caacccggta agacacgact tatcgccact ggcagcagcc actggtaaca 5460 
ggattagcag agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg tggcctaact 5520 
acggctacac tagaagaaca gtatttggta tctgcgctct gctgaagcca gttaccttcg 5580 
gaaaaagagt tggtagctct tgatccggca aacaaaccac cgctggtagc ggtggttttt 5640 
ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat cctttgatct 5700 
tttctacggg gtctgacgct cagtggaacg aaaactcacg ttaagggatt ttggtcatga 5760 
gattatcaaa aaggatcttc acctagatcc ttttaaatta aaaatgaagt tttaaatcaa 5820 
tctaaagtat atatgagtaa acttggtctg acagttacca atgcttaatc agtgaggcac 5880 
ctatctcagc gatctgtcta tttcgttcat ccatagttgc ctgactcccc gtcgtgtaga 5940 
taactacgat acgggagggc ttaccatctg gccccagtgc tgcaatgata ccgcgagacc 6000 
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cacgctcacc ggctccagat ttatcagcaa taaaccagcc agccggaagg gccgagcgca 6060 

gaagtggtcc tgcaacttta tccgcctcca tccagtctat taattgttgc cgggaagcta 6120 

gagtaagtag ttcgccagtt aatagtttgc gcaacgttgt tgccattgct acaggcatcg 6180 

tggtgtcacg ctcgtcgttt ggtatggctt cattcagctc cggttcccaa cgatcaaggc 6240 

gagttacatg atcccccatg ttgtgcaaaa aagcggttag ctccttcggt cctccgatcg 6300' 

ttgtcagaag taagttggcc gcagtgttat cactcatggt tatggcagca ctgcataatt 6360 

ctcttactgt catgccatcc gtaagatgct tttctgtgac tggtgagtac tcaaccaagt 6420 

cattctgaga atagtgtatg cggcgaccga gttgctcttg cccggcgtca atacgggata 6480 

ataccgcgcc acatagcaga actttaaaag tgctcatcat tggaaaacgt tcttcggggc 6540 

gaaaactctc aaggatctta ccgctgttga gatccagttc gatgtaaccc actcgtgcac 6600 

ccaactgatc ttcagcatct tttactttca ccagcgtttc tgggtgagca aaaacaggaa 6660 

ggcaaaatgc cgcaaaaaag ggaataaggg cgacacggaa atgttgaata ctcatactct 6720 

tcctttttca atattattga agcatttatc agggttattg tctcatgagc ggatacatat 6780 

ttgaatgtat ttagaaaaat aaacaaatag gggttccgcg cacatttccc cgaaaagtgc 6840 

cacctgacgc gccctgtagc ggcgcattaa gcgcggcggg tgtggtggtt acgcgcagcg 6900 

tgaccgctac acttgccagc gccctagcgc ccgctccttt cgctttcttc ccttcctttc 6960 

tcgccacgtt cgccggcttt ccccgtcaag ctctaaatcg gggcatccct ttagggttcc 7020 

gatttagtgc tttacggcac ctcgacccca aaaaacttga ttagggtgat ggttcacgta 7080 

gtgggccatc gccctgatag acggtttttc gccctttgac gttggagtcc acgttcttta 7140 

atagtggact cttgttccaa actggaacaa cactcaaccc tatctcggtc tattcttttg 7200 

atttataagg gattttgccg atttcggcct attggttaaa aaatgagctg atttaacaaa 7260 

aatttaacgc gaattttaac aaaatattaa acgtttacaa ttt 73 03 



<210> 16 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer used to amplify human DNA 



<210> 17 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer used to amplify human DNA 
<400> 17 

cactccatcg ctatgatgaa ggg 23 

<210> 18 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer used to amplify human DNA 



<400> 16 

ccatggaaac ggtacccagc aggc 
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<400> 18 

agttcccttc atcatagcga tgg 
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<210> 19 
<211> 20 
<212> DNA 



<213> Artificial Sequence 
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<220> 

<223> Primer used to amplify human DNA 
<400> 19 

aatgtacagg atgctggggt 

<210> 20 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer used to amplify human DNA 
<400> 20 

cccttgtagc tggtggagtt ccctt 

<210> 21 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer used to amplify human DNA 
<400> 21 

tgtgtcatcg agctggcttc 

<210> 22 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer used to amplify human DNA 



<400> 22 

atcttctgca agtggctcca 
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